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Abstract 

Consider the problem of a multiple access channel with a large number of users. While several multi¬ 
user coding techniques exist, in practical scenarios, not all users can be scheduled simultaneously. 
Thus, a key problem is which users to schedule in a given time slot. Although the problem has 
been studied in various time-independent scenarios, capacity scaling laws and algorithms for time- 
dependent channels (e.g., Markov channels) remains relatively unexplored. The basic assumption 
that users’ capacities are i.i.d. random variables is no longer valid, and a more realistic approach 
should be taken. The channel observed by a user is time varying, hence its distribution changes 
in time, i.e., a channel with memory. Since intelligent user selection has significant advantages, 
analyzing the distribution of the maximal capacity seen by a user in the system can give a good 
approximations for the capacity scaling law. 

In this work, we consider a channel with two channel states, Good and Bad, where in each point 
in time a user is associated with one of them. The process of moving between states is modeled 
by a Markovian process (Gilbert-Elliott Channel). First, we derive the expected capacity under 
scheduling for this time dependent environment and show that its scaling law is O ( rr g y 7 2 log K + 
fig), were er g ,fj, g are the good channel parameters (assuming Gaussian capacity approximation, 
e.g., under MIMO) and K is the number of users. Analysis uses Extreme Value Theory (EVT) 
along with finding the suitable normalizing constants. 

In addition, a distributed algorithm for this scenario is suggested along with it’s channel ca¬ 
pacity analysis. The algorithm is threshold-based and the rate for exceeding it is analyzed using 
Point Process Approximation (PPA). The analysis is done using the properties of the extremes 
of chain dependent sequences, which is proved in this study to converge to one of the extreme 
value distributions. The expected capacity, while imposing the discussed algorithm, scales as 
O (e -1 (er gA /2 log K + n g )), hence, there is no loss in optimality due to the distributed algo¬ 
rithm. The foundation for extending this study to more channel states is set and can be easily 
done. 

Finally, we turn to performance analysis of such systems while assuming the users are not neces¬ 
sarily fully backlogged, focusing on the queueing problem and, especially, on the strong dependence 
between the queues. We adopt the celebrated model of Ephremides and Zhu to give new results on 
the convergence of the probability of collision to its average value (as the number of users grows), 
and hence for the ensuing system performance metrics, such as throughput and delay. We further 
utilize this finding to suggest a much simpler approximate model, which accurately describes the 
system behavior when the number of queues is large. The system performance as predicted by the 
approximate models shows excellent agreement with simulation results. 
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Chapter 1 

Introduction 


Numerous innovations, such as sophisticated coding techniques and advances in the endpoint equip¬ 
ment, have been suggested over recent years in order to cope with the high traffic demands over 
the last-hop wireless medium. Yet, the rapid growth in the number of users and devices which are 
connected via this last mile wireless access architecture requires efficient scheduling techniques as 
well, since in practical scenarios, not all of them can be scheduled together. Indeed, scheduling 
schemes have been widely studied is such multiuser environments, trying to exploit the multiuser 
diversity which is inherent in wireless medium, by scheduling users when their instantaneous channel 
quality is at its best. Utilizing multiple antennas both at the transmitter and the receiver has also 
been adopted by state of the art wireless standards such as 802.lln, 802.11ac and LTE advanced. 
Multiuser diversity strategies under such Multiple-Input Multiple-Output (MIMO) technologies 
have been exploited in many scheduling studies, e.g., in 0. i. where Zero-Forcing Beamforming 
(ZFBF) was investigated and user selection was performed in order to avoid interference among 
users streams. 

Scheduling schemes can be categorized as centralized , typically requiring high overhead in col¬ 
lecting channel reports from many users by the central entity (e.g., access point or base station), 
in order to perform informed user selection, or distributed , which are designed such that the users 
can schedule themselves. One such opportunistic distributed scheme which has shown excellent 
exploitation of multiuser diversity, lets the users transmit based on their instantaneous channel 
condition. Specifically, users may transmit only if their channel gain is above a given threshold. 

While the problem in its many forms, as described above, has been studied in various time- 
independent scenarios, capacity and algorithms for time-dependent channels (e.g., Markov channels) 
remained relatively unexplored. We consider the situation for the upstream traffic in a distributed 
manner with the assumption that a user experiences a time varying channel, therefore a time depen¬ 
dent model is suggested, which reflects the time varying channel distribution. Using this model and 
its properties we were able to achieve closed analytic expressions in different ways for the channel 
capacity while using two statistical tools, Extreme Value Theory (EVT) and Point Process Approx¬ 
imation (PPA). Those enabled us to examine the limit distribution of the maximal capacity value 
for the transmitting user, i.e. multi-user diversity. We also looked on a model that describes the 
segmentation of users according to the different channel states they exist in, and tried to obtain 
also the expression for the channel capacity. 

Obviously, a distributive paradigm in which users schedule themselves for transmission can re¬ 
sult in collisions between users, which, in turn, affects channel utilization and quality of service 
(QoS). In this work we are interested in characterizing these important metrics. We study users’ 
performance under realistic traffic patterns, in which users are not fully backlogged and the arrival 
process of each user is a random process. Specifically, we assumes a realistic model in which the 
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arrival process of each user is modeled as a Poisson process. Each user transmits packets in a 
hrst-in-first-out (FIFO) manner, such that packets arriving to a user which is already busy with a 
pending packet transmission, wait for their turn to be transmitted. Accordingly, each user maintains 
a queue in which it stores packets waiting for transmission. We study performance metrics other 
than capacity, such as delay and buffer occupancy. We suggest a general model, consisting with a 
system of K users in a random, multiple-access system who interact and affect each other. This 
theoretical problem was studied in the past and it has been shown that the interaction between the 
queues in such systems makes the analysis inherently difficult. Thus there is no clear understanding 
of the performance of such systems with large number of users (more than 3), due to the complexity 
in describing the strong interdependence between the user’s queues. Only approximate models were 
developed in order to describe the system behaviour. 

This work is organized as follows. In Chapter [2} literature survey is given which will review 
related work in the subject. In Chapter [3] we will review some of the relevant materials for better 
understanding the problem and it’s properties. When part of which will cover some results con¬ 
cerning EVT and PPA. Those are necessary to carry out the asymptotic analysis of the channel 
capacity. We will elaborate on results that are relative for time dependent processes (stationary 
sequences), which we shall use in our time dependent situation. In Chapter [4] we will describe our 
model of the system and basic assumptions for this work. As mentioned before, this work is divided 
to two principal areas of research, the first is given in chapter [5j there we will focus on the design 
and analysis of an algorithm which schedules users in the time-dependent regime in a distributed 
manner. The algorithm is threshold based, and takes the advantages of Multi-User diversity in its 
operation. The resulting channel capacity is given analytically, simulations and bounds are also 
given. In chapter [6j the second area will be given, which will examine the performance analysis of 
the queues in our distributed multiple-access system. Models for the independent and dependent 
time scenarios is presented along with numerical results. 

1.1 Main Contributions 

• In chapter [5j we start with giving a closed expression for the channel capacity scaling laws, 
in a centralized time-dependent environment, while the best user, with the highest channel 
quality, is chosen to utilize the channel. Then we present a distributed algorithm for the 
users scheduling mechanism, and the time dependent channel capacity scaling laws under 
this algorithm. During the analysis of the distributed algorithm, we show a proof for the 
extreme values convergence of our chain dependent process to one of the EVT distributions, 
while using the dependency condition D and D' for stationary sequences. And as part of 
this convergence, we derive the normalizing constants for the convergence to extreme value 
distribution, where the marginal distribution is a mixture of normal distributions. All of these 
analytical developments, set foundations for the extension of the channel model to more than 
two states, and finding the maximal value distribution for this case to derive the expected 
channel capacity. 

• In chapter [6j we present approximate models for the systems’ queues behaviour. We first 
concentrate and review the model presented in 13 , while fitting it to our setting. Numerical 
results and interesting observations are presented for this elaborated model. Then, we present 
a simpler approximate model which in contrary to the former, is able to describe the systems’ 
behaviour when the number of queues is large. Lastly, we suggest a time dependent queue 
model which relates to our time dependent channel and shows very good agrement with 
simulations results. 
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Chapter 2 

Related Work 


Two of the most important results in network information theory are the capacity of the Gaussian 
multiple-access channel (MAC) and the capacity of the Gaussian broadcast channel (BC). The de¬ 
sire to maximally utilize both channel capacities is in the heart of many of the studies dealing with 
shared medium communication. As mentioned in the pervious chapter, scheduling and user selec¬ 
tion is a crucial part in achieving these capacities when it comes to large population. In this study 
we are dealing with the Gaussian MAC model, nevertheless many ideas from works concerning the 
Gaussian BC are extremely relevant such as the utilization of multi-user diversity and the use of 
order statistic (EVT) for statistical deductions. In the following sections we review some of state of 
the art works concerning both types of channels and the attempts to obtain the optimal capacity 
by using different methods for scheduling and user selection along with variety of statistical analysis. 

The communication system considered in this work is single user MIMO - Multiple Input Mul¬ 
tiple Output. Basically it means to use multiple antennas at both the transmitter and the receiver 
in order to increase data rates through multiplexing or to improve performance through diversity. 
Multiplexing is obtained by exploiting the structure of the channel gain matrix to obtain indepen¬ 
dent signalling paths that can be used to send independent data. 

By using this system we can achieve high data rate without increasing the total transmission power 
or bandwidth. Additionally the use of multiple antennas at both the transmitter and receiver pro¬ 
vides significant increase in the capacity [4}j7 . MIMO has been supported by several novel wireless 
standards such as 802.11ac and LTE Advanced. 


2.1 Scheduling Algorithms 


While considering a large multi-user networks, each user will experience different channel state 
which caused by various conditions. This diversity in users’ channel state may be used to enhanced 
the ergodic capacity of the channel. In 8], it was shown that in order to improve spectral efficiency, 
at any given time, only the user (or users) with the strongest signal should transmit over the entire 
bandwidth. Hence, we must choose intelligently the user with minimum complexity and overhead. 
The work of |9i improved the work of [§] when considering delay aspects, when they treat the 
situation of starvation. Another work which strengthened the fact that appropriate user selection is 
essential, and in several cases can even achieve optimality with sub-optimal detectors is 
discussed various user selection methods in several MIMO detection schemes. 


10 . which 


2.1.1 Broadcast Channel 

Dirty Paper Coding (DPC) for Gaussian broadcast channel, have proven to be the optimal solution 
for achieving the channel capacity 11 . In practice this technique is not used as it involve complex 
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computations and it is not feasible for large population of users. Hence, with the use of multi-user 
diversity, simpler methods for achieving channel capacity ware derived. In |2], the researchers in¬ 
vestigate a technique to achieve the MU-MIMO capacity called Zero-Forcing Beamforming (ZFBF). 
The paper investigates if, for a large number of users, simpler schemes can achieve the same per¬ 
formance as dirty paper coding (DPC) technique. 

Beamforming (BF) is a strategy that can serve multiple users at a time, where each user stream 
is coded independently and multiplied by a beamforming weight vector for transmission through 
multiple antennas. By choosing the weight vectors mutual interference between different streams 
could be reduced or disappear, allowing spatial separation between users and as a result support 
multiple users simultaneously. In the paper the researchers have managed to present a low com¬ 
plexity algorithm for semi orthogonal user selection, which achieves a sum rate close to the optimal 
rate given by DPC. In addition they presented a fairly scheduling scheme that is asymptotically 
optimal and has good performance. 

As BF and DPC show similar performances of the sum rate capacity, a comparison was made with 
time sharing capacity in 12 , and the researchers’ conclusion was that the sum rate of DPC and 
BF outperform of the time sharing if the transmitter is equipped with multiple antennas. In 13 


asymptotically optimal scheme for downstream traffic for coordinated beamforming where multiple 
BSs jointly optimize their respective beamforming vectors was given. In l], another scheduling 
algorithm over a MIMO broadcast channel is introduced. Given a set of users, the scheduler selects 
more than one user and transmits independent data to them simultaneously by using zero-forcing 
beamforming. Taking computational complexity into account, a greedy method finding the best and 
most orthogonal channel vectors is proposed. Additionally, considering fairness among asymmetric 
users, the researchers also propose an asymptotically fair scheduling algorithm. The technique, 


opportunistic beamforming and nulling, was introduced in 14 .It is used to induce fast and large 
fluctuations in the SINR of the users so that multiuser diversity can still be exploited . This 
technique amplifies the possible multiuser diversity gain while satisfying reasonable latency require¬ 
ments. In 15 , low complexity scheme are considered and compered in means of complexity and 
It is shown that under quiet simple settings linear beamforming achieves the best 

the authors suggested opportunistic beam¬ 


performance. 

tradeoff between performance and complexity. In 16 


forming and scheduling schemes with different linear combining techniques. The authors used EVT 
to derive throughput and scaling laws of the effective signal-to-interference ratios. 


In 17 , a scheduling scheme in TDMA system is suggested which uses the Wishart channel 


matrix HH' eigenvalues analysis. Another work regarding user selection is 18 . The paper consid¬ 


ers the benefit of opportunistic channel state dependent scheduling which can provide significant 
performance gains for wireless networks by exploiting the independence of fading statistics across 


the user population. A combination of spatial multiplexing with multiuser diversity is given in 19 


The researchers develop an optimal cross-layer scheduling mechanism that executes fair scheduling 
at the upper layer and optimal antenna assignment at the physical layer. For fair scheduling, they 
propose a framework that achieves the objective of maximum capacity and proportional fairness. 
For optimal antenna assignment, they consider the Hungarian algorithm that maximally utilizes 
the characteristics of MIMO systems by adopting the graph theoretical approach. 

20,21 


More relevant works are 


and a survey on limits of MIMO BC can be found in 22 


The characteristics of Quality of Service (QoS) parameters can also be used as appropriate 
scheduling decisions rules, in 23 , a multi-user downlink data scheduler with quality-of-service 


(QoS) provisioning was introduced. In addition to algorithms that are aware of both channel and 
queue state information to achieve the maximum aggregate utility in the network. Another scheduler 
using queue-length-based scheduling is introduced in 24 , which implements a congestion control 


mechanism either at the base station or at the end users and can lead to fair resource allocation 


and queue-length stability. In 25 a practical low-complexity scheduling strategy was suggested by 


users’ queue state, along with sub-optimal multiplexing strategy. 
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Another relevant work which use order statistics is 26 , where the authors suggested a subcarrier 


assignment algorithm for Orthogonal Frequency Division Multiplexing (OFDM) based systems, and 
used order statistics to derive an expression for the resulting link outage probability. Order statistics 
(or EVT) is also required when one wishes to determine the distribution of the selected users, rather 
than the a priori distribution. 

As we can see appropriate user selection is essential for achieving best utilization of the channel 
capacity. 


2.1.2 Multiple-Access Channel 

Centralized scheduling systems can use information given to them from the users in order to preform 
better scheduling decision and pass them along to the users. In 27 , a framework for opportunistic 


scheduling over multiple wireless channels is introduced. Users are selected for data transmission 
at any time, though with different throughput and system resource requirements. The article intro¬ 
duced and analyzed Multi-channel Fair Scheduler (MFS), the first wireless scheduling algorithm that 
provide long term deterministic (MFS-D) and probabilistic (MFS-P) fairness guarantees respectively 
over multiple wireless channels. In [28] , the writers present a new low complexity user selection 
algorithm based on the adaptive Markov chain Monte Carlo (AMCMC) optimization technique to 
maximize the sum capacity in an uplink MIMO multiuser system. With the proposed algorithm, 
99% of the optimal capacity obtained by exhaustive search method can be achieve. 

In 29 , a novel scheduling metric was derived based on ergodic sum capacity. A per-user receiver- 


side correlated channel model was combined with a second-order Taylor series of the ergodic sum 
capacity as a function of SNR. Thus, an ergodic capacity approximation can be written as an ex¬ 
plicit function of the channel model parameters, i.e., the receiver-side channel correlation matrices. 
This approximation can directly serve as a measure of spatial compatibility among a given set 
of users, i.e., as a scheduling metric. The scheduling metric presented in this work differs from 
previously proposed metrics mainly in the fact that no instantaneous channel state information is 
required. Long-term channel statistics are sufficient, implying scheduling robustness and reduced 
effort in channel sounding. In 30 , an effective MIMO precoding scheme to handle the inter¬ 


cell interference for uplink communication in a multicell environment is introduced. The precoding 
scheme allows to select the number of transmit streams adaptively to the surrounding environments. 


Distributed Scheduling 


In contrast to the centralized approach, distributed scheduling system decisions is preformed au¬ 
tonomously relying solely on information available at the user. The basic assumption is the the 
users has all information they need in order to preserve the integrity of the algorithm and updates 
from the BS are negligible. 

In '31, 32 , a decentralized MAC protocol for Orthogonal Frequency Division Multiple Access 


(OFDMA) channels is given, it is a threshold based algorithm and each user estimates his channel 
gain and compares it to the threshold, in case of exceedance the user can transmit. The optimal 
threshold indeed maximizes the number of slots in which only one user exceeds it. This distributed 


scheme achieves 1/e of the capacity which can be achieved by scheduling the strongest user. 33 
extended the distributed threshold scheme to multi-channel setup, where each user competes on m 
channels and examined proportional fairness in heterogeneous systems. In 34 , the authors used a 


similar approach for power allocation in the multi-channel setup, and suggested an algorithm that 
asymptotically achieves the optimal water filling solution. To reduce channel contention, 31 35 


introduced a splitting algorithm which resolves collision by allocating several mini-slots devoted to 
finding the best user. Assuming all users are equipped with a Collision Detection (CD) mechanism, 
the authors also analyzed the suggested protocol for users not fully backlogged, where the packets 
randomly arrive with a total arrival rate A and for channels with memory. 36 used a similar split¬ 


ting approach to exploit idle channels in a multichannel setup, and showed improvement of 63% 
compared to the original scheme in 31 .In 37 , the authors perform a joint optimization of both 
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the thresholds and the access probabilities, which provides a fair allocation that achieves a good 
tradeoff between total throughput and fairness. 38 extended the suggested scheme in ;31| and gave 
a novel way to analyse such threshold-based algorithms, the Point Process Approximation (PPA), 
and apply it to non identically distributed scenarios as well. 

Extreme Value Theorem - one of the ways to analyse such threshold exceedances is to obtain 
the distribution of the maximum channel gain. A convenient tool for analyzing this kind of extreme 
distributions is the Extreme Value Theorem (EVT). As the number of users rises (K —$■ oo) the 
distribution function of the maximal capacity degenerates, therefore EVT helps to obtain the limit 
distribution for large number of i.i.d. users. Prior work which is using this tool and refer to these 
maximal distributions can be found in the article of Choi and Andrews (2008) |39| , where we can find 
the scaling laws of maximal base station scheduling via Extreme Value Theory (EVT). By scheduling 
the user with the strongest channel among K users one can gain a factor of 0 {\/2 log K) in the 
expected capacity compared to arbitrary (random, Round-Robin) scheduling. Works mentioned in 
this chapter such as 16 26,38 , also used EVT for statistical deductions. EVT has been proved 
useful for analyzing reliable broadcasting as well [40||4l]. In this work EVT is used in the context 


of stationary process since this work, in contrary to the works presented above, assume the realistic 
assumption of time dependency. 


2.2 Time Dependent Channel 


Since this study extends the problem to the case which the users experience a time varying channel, 
the system model should be described as a system with memory which has different channel states. 
In the article of Gillbert (1960) [42], we can find a model for burst-noise binary channel, where 
the channel uses a Markov chain in order to illustrate a case of error free with a good state and a 
case of probability for error in transmission with a bad state. The channel capacity for this model 
is derived. Proceeding to the article of Gillbert , in the article of elliott (1963) [43], a means for 
estimating error rates for binary block codes was introduced. In 44 , the researchers form a finite- 


state Markov channel model to represent Rayleigh fading channels using the expended model of 
Gillbert-elliott. They develop and analyze a methodology to partition the received signal-to-noise 
ratio (SNR) into a finite number of states according to the time duration of each state. Each state 
corresponds to different channel quality indicated by bit-error rate (BER). The number of states 
and SNR partitions are determined by the fading speed of the channel. In 45 a review on the 


development of fading channel (i.e. channels with states) concerning statistical models, capacity 
in terms of single user and multiuser transmission and more is presented. More works which are 


highly relevant are 46 47 dealing with the capacity of a time dependent channel and the optimal 


power allocation under average power constraints. 


2.3 Limit laws in time dependent channels 


The study on the limit laws for i.i.d. random variables was fully characterized in many works 
starting with 48 , therefore studies regarding stationary and non stationary sequences and their 
extreme laws became the natural extension for this problems. One interesting case is the chain 
dependent process which aligns with our desire to model the users’ channel as described above. 
Chain dependent process was first introduced by Janssen (1969, j'49|) and it is known as the J — X 
process, where we have the bivariate random variables sequence (J n ,X n ),n A 0. The sequence X n 
depends on the state which the process J n exist in. Exact formulation for the problem will be given 
later. We will review some relevant work in this subject. In 50] we can find characterization for 
the limiting behavior of the maximal value distribution of the sequence X n while conditioning on 
J n . With suitable normalization constants, the exact solution to the distribution of the maximum 
value is given. This solution is computationally difficult and was formed by using the theory of 
semi-Markov processes, which employs similar formulation for matrices as formed by the J — X 
process definition, in order to approximate the limiting distribution laws. In addition they establish 
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basic properties of the normalizing constants, but did not give a specific way to derive them. In 51 


and in its preceding work 52 necessary and sufficient conditions are found for the distribution of 
the maximal value to be relatively and almost surely stable. One important work is 53 which 


stated that the limiting behaviour of the maximal value is not dependent on the initial distribution 
of the underlying Markov chain, thus the sequence J n may start with the stationary distribution 
of the chain which make X n also stationary process. In addition the strong mixing property of 
chain dependent sequences was also given leading eventually to the fact that under fairly general 
conditions the limiting behavior of the maxima for chain dependent processes is identical to that 
for the i.i.d. process with the same marginal distribution. In 

dependency conditions of the X n sequence known as D and D' which were given in 


54 and in its preceding paper 55 


56 were used 


while achieving the results for the limiting distributions for chain dependent process. Properties for 
a suitable normalizing constants were also given under general conditions, in order for the limiting 
behaviour will converge to one of the limiting distributions introduced in 48 . Rate of convergence 


for these approximations for the limiting convergence can be found in 57 . It is important to mention 


that all the studies above showed convergence to one of the three limiting distribution given in 48 


and in some we have specific characterizations for achieving the normalizing constants. In this work 
the channel model is based on chain dependent process and results regarding the maximal value 
behavior is relevant for understanding the channel capacity in our scenario, which make the studies 
above extremely relevant. Similarly to 55 we used the long range and local dependence condition 
D and D' respectively, for finding the extreme laws for our sequence. However the assumptions 
made in 55 regarding the marginal distributions are more strict comparing to the ones we did 


resulting with different normalizing constant for the convergence. 


2.4 Queueing system 


The analysis of the performance evaluation of our random multiple-access system is an extremely 
interesting and important theoretical problem. The interaction between the queues in such systems 
makes the analysis to be inherently difficult 58 , and the importance of such analysis was recognized 


59 . As will be described in the sequel, we are dealing with a slotted time system and therefore 


the most notable system is the slotted aloha. This system was widely studied in the past with two 
major aspects, the stability region of the system and its QoS properties and performance. Where 
the first aspect relate to the problem of finding the values of the arrival rates so the system will be 
stable, and the second aspect relate to the analysis of such system in terms of delay, mean queue size 
and more. Here we will not engage the stability problem, since existing results are known for the 
cases of K = 2 60 61 and K = 3 62 , and also sufficient bounds on the arrival rates can be found 


60 63 64 for K > 3 in order to maintain stability. In fact, we will investigate the performance 


of such system which have appeared in many works. 

Even toady there is no clear understanding of the performance of such systems with large number 
of users (more than 3), due to the complexity in describing the strong interdependence between the 
user’s queues. Only approximate models were developed in order to describe the system behaviour. 
In [65] , an iterative approximation model was suggested while using decoupled Markov chains. A 
refined model was presented in 3]. In 66", the mean delay was given for the case of two identical 


users, as well as an approximation for a larger population. Extension for slotted CSMA/CD model 
can be found in 67 . 
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Chapter 3 

Preliminaries 


This chapter offers the required technical background which will be used throughout this work. 
Starting with the Gaussian channel through the definition of a MIMO system and the its capacity 
under Gaussian channels, statistical tools for analysing independent and dependent sequences of 
random variables and the review of results regarding sequences which depends on an underlying 
Markov chain. 


3.1 The Gaussian Channel 


In information theory the communication between two points means a physical process which the 
information transfers through it, therefore it is subject to uncontrollable ambient noise and im¬ 
perfections of the physical signalling process itself. In 68 it is proven that the channel capacity 


is the tightest upper bound on the amount of information that can be reliably transmitted over a 
communication channel, while taking into account the interferences. 

Our communication system is MIMO, and between each pair of antennas the channel model is a 
Gaussian channel which is defined as follows: 


Y = X + Z (3.1.1) 

Where Y is the output of the channel with input X and with an independent noise Z with variance 
N. Without any conditions the capacity of this channel may be infinite Since X can take any value. 

If the variance of the noise is zero, then the receiver receives the transmitted symbol perfectly, 

whereas if the variance of the noise is non-zero we can choose an infinite subset of inputs so that 
they are distinguishable at the output with arbitrarily small probability of error. Such a scheme 
has an infinity capacity as well. 

The most common limitation on the input is an energy or power constraint: 

E[X 2 ] < P (3.1.2) 

And so the capacity of such channel is: 

C = Elxf<F I(X: ‘ Y) = 2 l0g (1 + ^ (3 - L3) 

The maximum is attained when X ~ A/"(0, P ). 


9 



3.2. Capacity of MIMO 


Chapter 3. Preliminaries 


3.2 Capacity of MIMO 


Consider a MIMO system with t transmit antennas and r receives antennas as illustrated in Figure 
|3.1| The linear link model between the transmit and receive antennas can be represented in the 
vector notation as 


y=Hx+n 


(3.2.1) 


where y is the r x 1 received vector from the t x 1 input vector x. n is r x 1 zero mean complex 
Gaussian noise vector with independent, equal variance real and imaginary parts. And H is a r x t 
normalized channel matrix, 



hi ,2 ■ 

hi,t\ 

hv.i 

h 2,2 

h2,t 

\h e , 1 

h r ,2 

h r ,t j 


(3.2.2) 


Each element hij n represents the complex gains between the n th transmit and m th 


receive antennas. 


Tx 

End 


Figure 3.1: MIMO channel 



3.2.1 multi-antenna Gaussian channels 


After describing the system and the capacity of a single Gaussian channel we would like to explore 
the capacity of the system. In [2] the researcher investigates the use of multiple antennas at the 
receiver and at the transmitter for single user communication over the additive Gaussian channel, 
with or without fading, and calculates the capacity of such a system. The linear model of the system 
is as given in (3.2.1), where now y £ C r , x £ C r . The transmitter is constrained in its total power 
to P, 

E[x ! x] < P. (3.2.3) 


The matrix H is a random matrix when all its entries are i.i.d. Gaussian variables with zero-mean 
independent real and imaginary parts, each with variance 1/2. This matrix models the Rayleigh 
fading environment of the channel. In j4], the researcher focus in the scenario which the matrix H 1 
is a random matrix, but in order to expose the techniques used to calculate the channel capacity 
he consider the case which H is deterministic. 

Using the singular value decomposition and consider H is deterministic we will derive the capacity 
of this cannel. 

This is a vector Gaussian channel, We can compute the capacity by decomposing the vector channel 
into a set of parallel, independent scalar Gaussian sub-channels using SVD. 

The matrix H can be written as H = UDV' where U £ C rxr and V £ C txt are unitary, and 
D £ M. rxt is non-negative and diagonal (the diagonal elements are the ordered singular values of 
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the matrix H). 

Therefore (4) can be written like this: 

y = UDV* x+n (3.2.4) 

because U and v are invertible we can write: 

rfy = DY<x+ rfn (3.2.5) 

and we mark it as: y=Dx+n , where n has the same distribution as n. 

Since H is of rank at most min{r,t }, at most min{r,t} of the singular values of it are non zero. 
In addition ||£|| 2 = ||a:|| 2 , thus the energy is preserved and we have equivalent representation as a 
parallel Gaussian channel: 

y t = A A + Hi, 1 <i< min{r, t} (3.2.6) 

So the capacity can be calculated as a system of parallel independent channels: 

C = ]Tlog(l+^^) (3.2.7) 

i 

where Pi*,..., PVr t i are the waterfilling power allocations: Pi* = (// — ^ 2 ) + , with y chosen to 
satisfy the total power constraint E,P,;* = P. 

In the case which H is random, it is shown that the capacity of the channel is achieved when 
a; is a circularly symmetric complex Gaussian with zero-mean and covariance The capacity is 
given by 

P [log (det(I r + yPP t ))], (3.2.8) 

as t grows the expression jHH^ —> I r almost surely (law of large numbers) and therefore the 
capacity is 

r log (1 + P). (3.2.9) 

The way to derive the capacity is by maximizing the average mutual information X(x\ y). 

In 5 | we can find an overview of the extensive recent results on the capacity of MIMO channels 
for both single-user and multiuser systems. The great capacity gains predicted for such systems can 
be realized in some cases, but realistic assumptions about channel knowledge and the underlying 
channel model can significantly mitigate these gains. 


3.2.2 A Gaussian Approximation to the MIMO Capacity 


Asymptotic capacity analysis of MIMO channels with arbitrary large number of antennas is widely 
used to overcome mathematical complexity and often allows for significant insight. Several works 
69 71 investigate the accuracy of a Gaussian approximation by deriving the variance and mean of 


the capacity of a MIMO system resulting with the fact that, distribution of the single-user-MIMO 
capacity follows the Gaussian distribution with mean that grows linearly with min(r, t), and variance 
which is mainly in influenced by the power constraint P. 

According to this, in this work each user capacity is approximated by a Gaussian distribution, i.e., 
for each user * , G ~ Mm*, o»). 


3.3 Extreme Value Theory 

As the number of users raises (K —> oo ) the distribution function of the maximal capacity is degen¬ 
erating, therefore a need to obtain the limit distribution for asymptotic analysis is required. 
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Extreme Value Theory (EVT) is a common tool used to approximate such limit distributions, for 
large number of random variables. In this sub-section, we briefly review some EVT important re¬ 
sults that are used in the asymptotic analysis of the capacity gain( [72], 73 ). 

We first review results for i.i.d. sequences. We then elaborate on extremes results for stationary 
sequences which will be used when analysing the time dependent capacity sequence each user ex¬ 
perience. 


Theorem 1. ( ^72], fTffj/j. Let (Xi, ...,X n ) be a sequence of i.i.d. random variables with distribution 
F(x), and let M n = max (Xi,X n ). If there exists a sequence of normalizing constants a n > 0 
and b n such that as n —> oo, 

P{a n {M n - b n ) < x} -> G(x) (3.3.1) 

for some non-degenerate distribution G, then G belongs to the type of one of the following three 
distributions: 

Type I: G(x) = e~ e 
Type II: G(x) = j 

f e -(-*) 

Type III: G(x) = j ® 

The three types of distribution limits can be formulated to a generalized extreme value (GEV) 
distribution type 

G(z) = exp{—(1 + £(——(3.3.2) 

<7 

and we say that F(x) is in the domain of attraction of G, where £,/k, cr are the shape, location and 
scale parameters respectively. 


— OO < X < oo 

x < 0 

for some a > 0 x > 0 
for some a > 0 x < 0 


In order to determine the shape parameter f by the ancestor distribution F(x), let us look on 
h[x) the following reciprocal hazard function 


h(x) 


1 - F(x) 
f(x) 


for xf < x < x F , 


where xf '■= inf{x : F(x) > 0} and x F := sup{x : F{x) < 1} are the lower and upper endpoints 
of the ancestor distribution, respectively. Then the shape parameter £ is obtained as the following 
limit, 


d 

dx 


h{x) 



G 


We have considering convergence of probabilities of the form P{a n (M n — b n ) < x}, which may 
be rewritten as P{M n < u n } where u n = u n (x) = x/a n + b n . So the theorem from above can be 
formulated in a different way: 


Theorem 2. ( [7<% Theorem 1.5.1]) Let {V„} be an i.i.d. sequence. Let 0 < t < oo and suppose 
that u n is a sequence of real numbers such that 


n( 1 — F(u n )) —> x as n —>• oo. 


Then 


P{M n < u n } —> e T as n —)• oo. 


For the more interesting and relevant case of normal distribution, we have the following 
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Theorem 3. ( f72 , Theorem 1.5.3]) If {X n } are an i.i.d. standard normal sequence of random 
variables, then the asymptotic distribution of M n = max (A'i,..., X n ) is a Gumbel distribution. 
Specifically 

P{a n (M n - b n ) < x} -» exp{—e _K } (3.3.3) 

where 

a n = (21ogn) 1/2 (3.3.4) 

and 

b n = (21ogn) 1/2 - i(21ogn) _1/2 [loglogn + log47r]. (3.3.5) 

If {X n ) follows the Gaussian distribution with mean p and variance a 2 , then the above normalizing 
constants are 

a n = cr(21ogn) 1 / 2 (3.3.6) 


and 


bn. — 1 


(21ogn) 1 / 2 — -(21og?r) 1 ^ 2 [log log n 


log 47 t] 


+ n- 


(3.3.7) 


Please note that if F(x) follows a Gaussian distribution we can see that T = e 


We turn now for results regarding maxima of stationary sequences. Dependence in stationary 
series can take many different forms, and it is impossible to develop a general characterization of the 
behavior of extremes unless some constraints are imposed) 73]). The following conditions makes 
precise the notion of extreme events being near-independent if they are sufficiently distant in time. 

Definition 1. We say {X n } is strongly mixing if there is a function g on the positive integers with 
g(k) —> 0 as k — > oo such that, if A £ 5(ATi,..., X m ) and B £ S(X m +k, X m +K+i, ■■■) for some 
k,m> 1, then | P(4fl B ) — P(A)P(B) \< g{k). Where $(•) denotes the a-field generated by the 
indicated random variables. Strong mixing was first introduced by Rosenblatt (1956). 

In weakening the strong mixing condition, one notes that the events of interest in extreme value 
theory are typically those of the form {X n < u}. Hence one may be led to propose a condition like 
mixing but only required to hold for events of this type. 

Definition 2. ( \7$f , \56f ) A stationary series X\,X 2 ,... is said to satisfy the D(u n ) condition if, 
for all i\ < ... < i p < j\ < ... < j q with j-\ — i p > l, 


P 7 j Xj i ft u n ,...,Xj ip u n ,Xj 1 ^ u n ,.. ■, Xj t/ V u n { 

P r {Xj /1 Si 'U'm ■••Xi p S O n { P r {Xj, S Uni ■ • •: ^j q — ^n}| ^ CX.(ri, /), 


where a(n, l) 0 for some sequence l n such that l n /n —> 0 as n —¥ oo. 


If u n is a sequence that increases with n the long range dependence condition D(u n ) ensures 
that, for sets of variables that are far enough apart, the difference of probabilities expressed above, 
while not zero, is sufficiently close to zero to have no effect on the limit laws for extremes. 

Another condition that is very relevant to us is the local dependence condition D'{u n 


74 


Definition 3. 


[n/k\ 

D'(u n ) : limsupn P{X i > u n ,Xj > u n } —>■ 0 as k — > oo 

n— voo 0 

J=2 

This condition bounds the probability of more than one exceedance among Xi, ..., X\ n /k \ ■ This 
will eventually ensure that there are no multiple points in the points process of exceedances which 
is necessary in obtaining a simple poisson limit for this point process as will be discussed later. A 
result regarding to the two conditions D{u n ),D'(u n ) is: 
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Theorem 4. ( f?H Theorem 3-4-1] ) Let u n be constants such that D(u n ), D'(u n ) hold for the 
stationary sequence {A„}. Let 0 < x < oo. Then 

P{M n < u n } —> e _T if and only if n( 1 — F(u n )) —> x. 

Thus, from the above we can conclude that the particular GEV type which applies is the same 
as if the sequence {X n } were i.i.d. with the same marginal distribution function F, and the same 
normalizing constants may be used. 


3.4 Point Process Approximation 

As mentioned earlier the expected channel capacity was derived in a number of method, one of them 
was done while using the point process approximation. PPA is a tool for analysing threshold arrival 
rates and tail distributions. This analysis imposes threshold-based algorithm which discussed earlier 
as an option to achieve the maximum channel capacity. Again we will start with the basic theory 
which concern an i.i.d. sequences and elaborate on stationary process. 


A point process on a set A is a stochastic rule for the occurrence and position of point events 


( 73 ). Our characterization for the statistical properties of the point process is to define a set of 
non-negative integer-valued random variables, N(A), for each A C A, such that N(A) is the number 
of points in the set A which can represent a period of time The expected number of points 

in any subset A C A, is defined to be the intensity measure A(A) of the process. The number of 
points in a given interval follows a Poisson distribution with intensity measure A(A) and intensity 
density A (t,x) where: 

A(A) = [ X(t,x)dA 


We assume A'i, X 2 , ..., X n is a sequence of i.i.d. random variables, with common distribution func¬ 
tion F{x). We suppose that F{x) is in the domain of attraction of some GEV distribution G(-), 
with normalizing constants a n and b n . We then define a sequence of point processes N\, N 2 , ... on 
[0,1] x R 2 by: 

N n = { 1 -,-= (3.4.1) 

In a n J 

We can notice that the numbers of occurrences counted in disjoint intervals are independent from 
each other, law value points are normalised to the same value bi with: 


k 


lim 

n —> 00 


Xf — b„ 

&n 


and high value points are retained in the limit process. Under all the above we get: 

Theorem 5. ( jTf, The sequence N n on the set [0,1] x (bi + e, 00 ), where e > 0, converge to 
a non-homogeneous Poisson process N, that is: 


N n —> N as n —» 00 , 


with intensity: 

A (t,x) = (1 -Kx)+ 1-1/? 

We are interesting in the sets of the form: B u = [0,1] x (u, 00 ) where u > bi which reflect the 
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users with a capacity that exceeded a threshold u. In this case the intensity measure is: 


A {B u ) = A([0,1] x (u, oo)) 

pi pOO 

= / A (t,x)dxdt 


/ 1—0 J X—U 

/■l 


ft =0 


- (i + £*)+ 1/f 

pi 

/ (l + ^u^^dt 

Jt =0 


dt 


— (i + £ w )+ 


-m 


where a+ denotes max{0,a}. The occurrences of capacity exceedances of the threshold u can be 
modeled by a Poisson process, with parameter A (B u ). In case the sequence {X n } are standard 
normal random variables: 


(i + £u) + m 



X — U 


Since we are interesting in time dependent environment we would like to explore the point 
process analysis for stationary sequences. In 176 we can find the point process of exceedances of a 
high level u n by a stationary sequence {Xi} (i.e., points where X, > u n ), obtaining Poisson limits 
under quite weak dependence restrictions. These involve a long range dependence condition D(u n ) 
and a local dependence condition D'{u n ) as defined in definitions [2] and [3] 

Theorem 6. ( f76f ) Let D(u n ),D'(u n ) hold (u n satisfying u n = u n {x), such that 1 — Fi(u n ) = 
P{X i > u n } ~ x/n as n —► oo for all t > 0 ) for the stationary sequence {X,}. Let N n be the point 

process, consisting of the exceedances of u n ( t). Then N n N as n oo, where N is a Poisson 
process with parameter x. 


As a result of the above theorem, the point process under the dependence consideration behaves 
just like one obtained from an i.i.d. sequence with the same marginal d.f. F. If the local condition 
is weakened or omitted, then clustering of exceedances may occur. 


3.5 Chain Dependent Process 


In this work we use a specific model for dependent data referred to as chain dependent model. We 
will review the formulation for this model and some relevant results. 

Let {J n } be a finite state space Markov chain with ergodic probability transition matrix P = 
( Pij), 1 < i,j < m. Further let {X n } be a chain dependent sequence such that 


L*[Jn j: Xn ^ X | Xq , Jq, X n —\, J n —\ — z) 

P{Ln = j-> X n T x | J n ~ i = i) = pijHi{x) 


(3.5.1) 


where Hi, i = 1 are the underlying non-degenerate and honest (Hi(+ oo) = 1) distributions, 

associated with each state of the chain. 

Immediate consequences 


50 


i The stationary probabilities associated with P are (7Ti, ...,7r m ). 


ii P(X n < x | J n - 1 = i) = Hi (x). 

iii P(X i < xi, ...,X n < x n | J 0 , Ji,..., J n -i =i) - JlLi p ( x i < \ J l - 1 ) 

The random variables {X n } are conditionally independent given the chain in precisely the 
sense given by (iii). 
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Define a nonnegative m x m matrix by Q{x) = ( PijHi(x )) and the random variable M n = 
max(Ai, ...,X n ), then In [50 we can find the distribution of the maximum given by 

P(J n = j, M n < x | Jo = i) = Q?j{x) (3.5.2) 

where Q n denotes nth matrix power of Q. The matrix Q with suitable normalizing constants 
dijn > 0 and bij n , i,j = 1, ..,m, n ^ 1, convergence to non-degenerate mass matrix as n —> oo 

{Qij^O'ijnX T bij n )( t { L)-,' (^J } ^ ^ OO. 

The Perron-Frobenius eigenvalue pu{x), of {Uij(x)}, is an extreme value distribution. To attain 
this result the Semi-Markov matrices theory was used. 


A very important results given in 53 states that that the limiting behaviour of M n is not 


dependent on the initial distribution of the underlying Markov chain. Thus it may be started with 
the stationary distribution (J 0 =7 r,, i = 1, in which case {X n } is stationary. In addition it 

is stated that every stationary chain-dependent process {X n } is strongly mixing (Lemma 1.1, |53j). 
Then under any initial distribution for Jq (theorem 3.1, [53 ) we have 


P(M n < d n ) —> l <t=> H n (d n ) —¥l as n 


(3.5.3) 


where d n is a sequence of reals, 0 < l < 1 and H{x) is a distribution function such that H(x) = 

TZi^ix). 


The dependency of the sequence {X n } can be characterized such that it would be easier to attain 
the limiting distribution of M n . Such dependency conditions are the D(u n ) and D'(u n ) as given 
in definitions [ 2 ] and [ 3 ] In 55 a similar analysis was performed for this purpose. They defined a 


class of sequences, were the chain dependent process is a sub case of it, that fulfill some properties 
which under them the extreme laws could be achieved. Among those properties, the ones that are 
relevant to us are that the D{u n ) and D*(u n ) (an extension for D'(u n )) conditions hold and that 
there exists a sequence of real numbers {u n } and real numbers Zi, i = 1 , ..., to , such that for every 
Hi, i = 1 , m, as in (ii) we have 


1 - Hi(u n ) = — + o ( - 


as n 


(3.5.4) 


Under the conditions (la) to (le) in [55] their basic result is 

Theorem 7. ( J5$j ) Let { X n }, {u n } be sequences of random variables and real numbers respectively, 
satisfying the conditions (la) to (le) given. Then 


Jim P(M n < u n ) = e{~y^pj Zj} 


(3.5.5) 


where pi, i = 1, are the stationary distribution on the chain and Zi are as given in (3.5.4). 


In addition for the above theorem they give a corollary regarding the properties for convergence 
to the specific extreme value distributions. However assumption (2.2) in their work does not hold 
for this study marginal distributions, which associated with the states of the chain. Therefore their 
specific rules for convergence does not apply here exactly. 
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Chapter 4 

Model Description 


We consider a slotted MIMO uplink system with K independent users and one base station. The 
time axis is divided into fixed-length intervals referred to as time slots or, simply, slots, in the sequel. 
The distribution of the channel capacity a single user experiences, is time varying according to a 
Gilbert Elliott model 42 . We first assume that a single user, with the strongest received signal at 


the base station, is selected by a centralized scheduler to utilized the channel and transmit in each 
slot. We then consider the distributed setting as well. Hence, we first wish to analyze the expected 
channel capacity where the base station chooses the strongest user for transmission using the CSI 
sent to it from the users. We then turn to analyzing the capacity under a distributed opportunistic 
user selection algorithm, which select the strongest user to transmit, without coordinating between 
the users and without all users sending CSI to the base station. We assume the transmitters 
approximate their channel parameters from a pilot signal sent from the receiver. 

The linear model of the system is: 


y(n)=H(n)x(n)+N(n) (4.0.1) 

y £ C r is the received vector from the input vector x £ C r where H is a r x t complex matrix (r 
- number of receiving antennas, t - number of transmitting antennas) and N is zero mean i.i.d. 
complex Gaussian noise with independent, equal variance real and imaginary parts. The transmitter 
is constrained in its total power to P, 

E[ x^x] < P. 

The matrix H(n) is a random matrix which models the Rayleigh fading environment of the time 
dependent channel. 


4.1 Time dependent channel 


the channel distribution each user sees is governed by a Markov chain with two states, G (for Good) 
and B (for bad), where each state determines the channel distribution (Figure 4.1). Thus, the 
channel is memoryless conditioned on the state process, and the capacity of the i-th user in each 
slot is determined by the Good Bad Markov process {J(n)}: 


N g when Ji(n) = Good 
Nb when Ji(n) = Bad , 


(4.1.1) 


where N g and N h are normal random variables with parameters (y g ,o g ) and (/q,,cq,), respectively. 
This is due to the Gaussian approximation for the MIMO channel capacity 170,71 . The param¬ 
eters reflect the differences in the channel qualities, that is, good channel parameters maintain 
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Figure 4.1: A Good Bad channel model according to 42 . The capacity distribution is either good 


or bad Gaussian random variable according to the state of the system. 


(1) t T g > a bl € 1R or (2) a g = a b and /x g > fj, b . 

As mentioned, we assume a non trivial arrival process and that the users are not always back- 
logged. The arrival process of new packages to the system is characterized by a Poisson process. 
In addition, we assume that all user are homogenous, thus, all users has the same arrival rate A. 
We present several approximate models. Thus, new packages may enter the system at any given 
(continuous) point on the time axis or at the beginning of a slot, depending on the approximate 
model. The size of the packages is set such that the transmission time is exactly one slot. We 
define the service time as the time from the moment a package becomes first in queue, until it is 
successfully transmitted. Hence, the service time of the packages is composed of the waiting time 
for transmission and the transmission time. In addition, we assume that services are synchronized 
to (i.e., can only start and end at) slot boundaries. This of course may cause collisions. 
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Chapter 5 

Capacity under time dependent 
channel 


In order to exploit user diversity, the user with the best channel mustjrtilizes the channel. Therefore, 
the problem is in finding the distribution of the random variable Mx , defined as: 

M k = max{Ci(n), C 2 (n ),..., C K (n)}, (5.0.1) 


where Ci{n) is defined in (4.1.1). We are interested in the expected channel capacity E[M K \. 

In the beginning of each slot the channel each user experience depends on the state he exists 
in, resulting with two independent groups of users, a good group and a bad group. The two groups 
are independent due to the independency between the users, and in each separate group the users 
identically distributed. The size of these groups is constantly changes according to each user state 
process which happen in the beginning of each slot. Given the state of all users the groups size are 
known and we may derive the expected channel capacity in the following way 


E[M k \ =E[max{C 1 (n),C 2 (n), ...,C K (n)}} 


K 


= '^2,E max{max{C'i(n),...,Cj(n)},max{G J -+i(n),...,C'*:(n)}} j P{j) 

3= 0 


(5.0.2) 


The distribution of the maximum for each group can be obtain by using EVT, although the compli¬ 
cated part is to do so for every group size. Unfortunately due to complicated analytical expressions 
the result remained open, and more details can be found in the appendices regarding this way of 
analysis. 


Nevertheless we were able to achieve a closed form for the channel capacity in more informative 
ways. The first one is by using the stationarity of the users’ capacities sequences, and the second is 
to analyse the rate of exceedance above some threshold while imposing our distributed algorithm. 


5.1 Capacity - Centralized Scheduling 


In this section we give the expected channel capacity with the assumption that the strongest user 


utilizes the channel. Let us consider the distribution of each Ci(n), as defined in (4.1.1), which 


is determined by the Good-Bad Markov process {The stationary distribution of the chain 


is 


( P a ) 

^ Q+/3 ’ ck+/3 ' ’ 


which we will denote as ( p,q ). Therefore, the distribution of C)(n) assuming a 
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stationary distribution is as follows: 

F(t) =P{Ci{n) <t) = 

P{N g {n) < t | J(n) = G)P{J{n) = G)+ 

P(N b (n) < t | J(n) = B)P(J{n) = B) 1 ' ' ’ 

= pFg(t) + qF b (t), 

where F g (t) and F b (t) are Gaussian distributions with parameters (n g ,a g ) and crj,) respectively 
which represent the users’ different channel states. The distributions parameters maintain a g > 
<7b, p g , Hb £ H or a g = a b and p g > p b - The distribution of the maximal value is in the form of 

P{Mr <x) = P(Ci(n) < x, C 2 {n) < x, ...,C n (n) < x) = F n {x), 

due to the independence between the users. We wish to test the behavior of F n (x) as n —> oo. 


The legitimacy to use the stationary distribution also comes from 53 as given in (3.5.3), the 
convergence of F n (d n ) to a real number l, where 0 < l < 1 and d n is a sequence of reals, exist if and 
only if, P{M n < d n ) —> l as n — >• oo. Thus we can analyze the expected channel capacity using EVT 
with the distribution above, as the marginal distribution function for each user. In order to do so, 
first we need to prove that convergence to one of the extreme distributions types G{x ) exists, and 
then find the constants a n and b n such that as n — > oo , P{Mk < u n } < x} — >• G(x). 72 1.6.1, 


1.6.2] gives necessary and sufficient conditions on the marginal distribution F to belong to each of the 
three possible domains of attraction of the extreme value distributions. The first sufficient condition 
states that if / has a negative derivative f for all x in some interval (xo, Xf), {xf < oo), f(x) = 0 
for x > Xf, and 


lim 


Pit) 


= -1 


(5.1.2) 


then F is in the domain of attraction of Type I extreme value distribution (Gurnbel distribution). 
The second necessary and sufficient condition for F to be in the domain of attraction of Type I 
extreme value distribution, states that there exists some strictly positive function g(t) such that 


lim 

t\x F 


1 - F(t + xg(t)) 
1 - F(t) 


(5.1.3) 


for all real x. 


Our main result in this context is the following. 

Theorem 8. Let (C\, ...,Ck) be a sequence of random variables with distribution F(t) as given in 
(5.1.1), then the asymptotic distribution of Mk = max (C ±,..., Ck) is of Type I. Specifically, 


where 


and 


P{aK{M K — bn) < x} ^ e 
\/2 log K 


a K = 


b K = a g i yj2 log A' - 


log log K + log fj' 

2 % /2bpT 




(5.1.4) 

(5.1.5) 

(5.1.6) 


The proof outline is as follows. First we start by solving directly the limit in condition (5.1.2), 
while using the sandwich rule, replacing the Gaussian CDF with its corresponding complimentary 
error function and using the upper and lower bounds of it. The second condition (5.1.3) has been 
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proven by finding that g(t ) = a' 2 /{t — p g ), along with the asymptotic analysis that the ”Bad“ Gaus¬ 
sian distribution has little affect in it’s tail comparing to the ’’Good” distribution. And finally using 
similar asymptotic analysis as preformed in condition (5.1.3) the normalizing constants was derived. 
For more details, see appendix [Aj which gives the fulfillment of the two conditions above and the 
derivation of the normalizing constants ax and bx- 


Using Theorem [ 8 ] we can now give the expected channel capacity 


E[M k \ = b K H- 

ax 


\J 2 log K — 


log log K + log p 
2^2 log K 


V21og K 


(5.1.7) 


Tg 


where 7 = 0.57721 is Euler-Mascheroni constant. Thus, for K sufficiently large, 

E\Mx\=a g V(2\ogK)+Vg+o(-^=^ . (5.1.8) 

That is, for large number of users, the expectation capacity grows like a g \J (2 log K). A word is 
in place here concerning the values of p and q. We assume that (p, q) are bounded away from zero, 
which implies that the uninteresting case which all users has bad channel state, or the opposite 
case where all have good channel does not apply. Also, in the proof of Theorem [ 8 ] we used this 
assumption while we made a division by p and q. Simulation results for the capacity distribution is 
given in Figure pTI] 


f(c) 



Figure 5.1: Simulation for the maximal capacity distribution, when choosing the maximal capacity 
among 5000 capacities that following the stationary distribution where the red line is the corre¬ 
sponding Gumbel density with the constants ax and bx- 


Remark 1 (The probability p). The probability p, which is the stationary probability to exist in a 
good state, rules the average number of users which are in good state while considering the whole 
system. In each time slot we can distinguish between two groups of users, the users that are in good 
state and the users that are in bad state. It is easy to show that the number of good users in the 
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good group on average is pK. Hence, as p grows the expected capacity grows since there are more 


users in good state, we can see it in the analytical result (5.1.7) for the channel capacity and in 
Figure \ 5. 


C[bps/Hz] 



Figure 5.2: The channel capacity gain as given in (5.1.7) in comparison to the 
from the good group with size Kp , for 5000 users as a function of p and p g = 


gain of choosing only 
V2, a g = 0.5. 


One may wonder on why considering the bad group at all, meaning, why should a user which 
belongs to the bad group will be taken into account in the scheduling decision process in the 
beginning of a slot. Thus leaving only the users with the good channel to compete for the channel. 
For this case, the capacity with K sufficiently large is 

E[M pK \ - <W(21ogpiC) +/h? + ° ( ^ lo g p jf ) (5- 1 - 9 ) 

In Figure [5T2] we can see the influence of the bad group on the capacity gain. For rather small 
values of p it is beneficial to schedule the strongest user from both groups. As p grows the size of 
the good group grows as well hence the two capacity gains converge to the case were all the users 
are in good state. In Figure [53] we can see the capacity gain as a function of the number of users, 
specifically, in Figure [5.3b|the difference between the gains is noticeable. 


5.2 Capacity - Distributed Scheduling 


When considering a distributed scheduling algorithm, we derive the expected channel capacity using 
PPA. PPA is a tool for analysing threshold arrival rates and tail distributions, while imposing a 
threshold-based algorithm for the channel access, which was discussed earlier as an option for a 
distributed scheduling mechanism. This method can also be found in 77 for the case of i.i.d. and 


heterogeneous users. First we analyze each user separately and examining its sequence of channel 
capacities over time. This sequence can be modeled as a point process on some interval, using the 


definition in (3.4.1), yet with the different that our process of points is formed from a dependent 


sequence of random variables. As it turns out, the exceeding points above some threshold u con¬ 
verges to a non-homogeneous Poisson process, which we discuss later, and the question that can 
be asked is, what is the rate of exceedance above the threshold u. We show that our dependent 
sequence converge to the same Poisson process if the sequence was of an i.i.d. random variables, 
which will eventually allow us to examine the sequence of all users’ capacities in a specific time. 


The sequence {Ci(n)} depends on what state user i exists in, which is, in turn, ruled by the 
underlying Markov chain process {Ji(n)} as defined in ( |4.1.1 ). This kind of process have been stud¬ 
ied before and is known as a J — X process by Janssen (1969, 49 ), or “chain-dependent” process, 
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C[bps/Hz] C[bps/Hz] 




Figure 5.3: The capacity gain for choosing the best user among all the population and only the 
good where a g = 0.5 and (a) p=0.5 (b) p=0.2 


emphasizing the fact that this is a natural extension of an i.i.d. process. The extreme laws for 
chain dependent processes have showed that, convergence to one of the extreme distributions exist 
under some conditions. This is important for the formation of the point process and the analysis 
of the rate of exceedance we do in the sequel. 


Before we give the formulation for the chain dependent process in our work, let us remind the 


formulation of the original J — X process as reviewed also in Chapter 
satisfies: 


78 , 53 


3.5 


The J — X process as 


P(Jn — j i X n ^ Oi | Jo , X\ , J\ , ..., X n _ — i — z) 

— P{Jn — ji X n ^ a | J n —\ — — PijHi{OL ), 


(5.2.1) 


where P is the transition matrix of the chain and Hi(a ), where i belongs to the state space, are the 
distribution functions associated with the chain states, respectively. Note that each state determines 
the distribution of A' for the preceding time transition. This means that given the chain process 
{ J n } The random variables of the {X„} process are conditionally independent. More precisely, 


P{X\ < «i, AT 2 < «2, ■■■■,X n < a n | 

JO = joi J\ = j 1) Jn— 1 = jn— l) 

= P( X\ < a.\ | Jo = jo)F(X 2 < «2 | Ji = ji) ■ • ■ 

^ | Jn— 1 = jn— l) 

= HMHM ■ ■ ■ Hj ri _ 1 (a n ). 


As discussed in Chapter 3.5 every stationary chain-dependent process {X n } is strongly mixing. 
This is true if the initial distribution of the chain is the stationary distribution, i.e. P(Jq = i) = tt , 
for all i in the finite state space, where 7r is the stationary distribution. Then, the distribution 
of X n is H(x ) = YiTtiHijx). This result is very important to us since we are interested in a time 
dependent environment and we would like to explore the point process analysis for the stationary 
sequences associated with it. 


Returning to our context, we shall consider the sequence {X n } as the sequence of capacities 
{Ci(n)} of the i-th’ user over time and the chain process {</„} as the sequence of the irreducible, 
aperiodic, 2-state Good-Bad Markov chain { Ji{n)}. The distributions Hi(a) will be the distributions 
Hi(a ), where i = g,b, according to the Good-Bad sates, respectively. Since we are analysing the 
capacity process of one user, which is identically to the other users, we will note {C n } as it’s capacity 
sequence and {J„} as the sequence of the chain. 
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Unlike the definition of the J — X process, in our model each state determines the capacity 


distribution in the current slot. So equation (5.2.1) according to our case can be written: 


P ( J n. jj C n £ Oi | ./0, C\ Cn— 1 5 Jn— 1 — 7 ) 

U( J n j, ^ Oi | 2) P^jHj(cx') 


(5.2.2) 


The fact that the capacity in each slot depends on the chain state in the current slot doesn’t impact 
the strong mixing property of the sequence {X n }. We will show that the J — C process as defined 
to meet our model, is still strongly mixing by definition |TJ while using the proof guidelines in 78 


Lemma 1. { C n } is strongly mixing with g{k) = JU 7q I P tj ~ n j I; where it is the stationary 
distribution of the chain and P = ( P k )ij ■ 

Proof. Let A and B be as definition [T] Then 

| P(Ar\B) - P(A)P(B) I 

< ^2 I P(AnB,J m = i,J m+k =j)- 
ije {o,i} 

P(A, J m = i)P(B , J m+k = j) | 

= p ( A \ J m = i)P(B\J m +k = j)P(J m = i) 

ijefo.i} 

| P(Jm+k = j\Jm — l') P(Jm+k ~ j) \ 

< E ~ 1 Dfc 

ij'efo,!} 


7 T'i I Pij — 7Tj |= g(k). 


Ph — 7 Tj |—>■ 0 as k 


For each i S 0,1, JT 7 r : 
mixing property, nevertheless, it is easy to notice that in fact as k 

i. 


oo, this was also used in 78 for showing the strongly 
oo, ( P k )ij —> 7 Tj regardless to 

□ 


Considering Theorem [ 6 j we will show that the point process under the chain dependent model 
convergence to a Poisson process just like one obtained from an i.i.d. sequence with the same 
marginal d.f. F, which in our case is H{x). The normalizing constants will be as given in Theorem 
[ 8 ]which maintains the condition 1 — H(u n ) = P{X > u n } ~ T /n asn-> oo. As mentioned before, 
the strong mixing condition hold for our sequence of channel capacity, and as a result so does the 
condition D(u n ) as defined in definition [2] which is a weakened case of strong mixing [56 . We 


would like to show that condition D'(u n ) as defined in definition [3] also holds so we would be able 
to characterize the rate of exceedance over the threshold u n . Note that we are only interested in a 
sequence of reals {u n } which satisfies 1 — H(u n ) = x/n + o(l/n), in the fulfillment of the D'(u n ) 
condition. 


Lemma 2. The local dependence condition D'(u n ) holds for the sequence {C n } as defined in (4.1.1). 
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Proof. 


[n/fcj 


limsupn P(C”i > u n ,C r > u n ) 


(a) 


Ln/fcJ 


< lim sup E E P(Ci > u n \ Jr = i) 

n ^°° r=2 i,je{0,l} 

• P{C r > U n \J r = j)P(Jl = i)P{J r = j\Jl = i) 

\n/k\ 

= limsupn^ ^ (1 - Hi(u n ))(l - Hj{u n ))'KiPZ j 

n—>-oo 0 . . . 

r=2 i,je{ 0,1} 

M, L ^ J v- ( r /I s 

< limsup n > >-ho - 

n->-oo ^ ' V n7r i V 77 

^ r=2 ij e { 0,1} V 2 V 


T 

-h 

nTTj 

[n/k\ 


o I - II t nPTj 


< lim sup n E E 


n—too 0 ^ TTj-TTi \7l 

r=2 »j€{0,1} ^ X 


- +0 ( - 
n 


t nPl 


< limsupn y f-+of-E 7^ max{P r } 

™->°° LfcJ Vn W/ iJ ^ 1} 7r i 7r * 

= (t 2 + o(l)) 2 ^ V TTi max{P r } —> 0 as k —» oo 

^ 7 T.-'TT: T J 


*.JG{0,1} J 

In the above chain, (a) is since once J\ is known, C\ and C r ,J r are independent, then we 
conditioned on J r . (b) is true since u n maintains n(l — Fi(u n )) = t + o( 1) so we have 


1 - Fi(u n ) = 1 - H(u n ) = 1 - E TTi Hi(u n ) 

i 

E 7Tj(l - Hi(u n )) = - 1 1 

n 


al¬ 


and therefore 


7 Tl 


(1 - Hi(u n )) = - - 7 Tt(l - Hi{u n )) < - + O . 

n n \ n J 


Note that 717 and t are constants. 


□ 


Thus, in our paradigm a single users’ channel capacity process has the same laws of convergence 
as if the sequence {C n } was i.i.d. with marginal distribution H(x). Therefore, since the users are 
independent and each user sees the same marginal distribution H(x), we can analyze the point 
process of the sequence of all users’ capacities at a specific time (e.g. time slot), resulting in the 
basic case of i.i.d. random variables, as given in Theorem [5] Considering the above, we now turn 
to evaluate the expected channel capacity. 


5.2.1 Distributed Algorithm 


The distributed algorithm for the channel access is based on the algorithm suggested in 32 . Given 


the number of users, we set a capacity threshold u such that only a small fraction of the users 
will exceed it. At the beginning of each slot, each user estimates its capacity for that slot. If the 
capacity anticipated by the user is greater than the capacity threshold, it transmits in that slot. 
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Otherwise, it keeps silent. Choosing the appropriate threshold value is of great significance. A lower 
value means more users exceedances, therefore more collisions, while a high value means more idle 
slots. The optimal threshold value is set such that one user exceeded the threshold on average, and 
as a result, its transmission is successful. We treat this slot as a utilized slot. Hence the expected 
channel capacity has the form: 


Cav (a) = P r (utilized slot)P[C|C' > u] 


In order to calculate E[C\C > w], the expected capacity experienced by a user who passed u, it is 
important to understand that the average capacity is ruled by the values of points that exceeds the 


threshold, so one needs to evaluate the distance of the exceeding points from the threshold. 75 give 
the analytical tools to compute the tail distribution of the exceeding points. These points follow 
the generalized Pareto distribution, hence by using the PPA exceedance rate results, and since we 
showed that we have the same exceedance as an i.i.d. case with H(x) as the marginal distribution, 
the result is the same and we have 

E[C\C > u] = u + — + o( —), 
clr ax 

where ax is the normalizing constant as in Theorem [8j 

We say that a slot is utilized if only one point out of all K points exceeds the threshold. Hence, 
as I\ —> co 


k[ t< 


l -T< 


K -1 


where the threshold u was chosen such that 1 — H(u) = 1/K. We will elaborate on this value in 
the next subsection. The expected channel capacity thus 


C av {u)=e 1 [u+ — +o( — 
a K \a K 


(5.2.3) 


In the next subsection we show that u = bx so when comparing the capacity result in (5.2.3) to 


the expected capacity E[Mx] as given in (5.1.7), for the centralized approach, we see that we have 
the same scaling laws, only that the distributed approach is smaller only by a factor of e” 1 . The 
normalizing constant ax along with the threshold u = bx is as given in Theorem [8j reflect the time 
dependency of the channel as explained before. 


5.2.2 Threshold Estimation 


The optimal threshold was set such that only one user on average exceeds it in each time slot. This 
is due to the analysis of 31 , where it has been shown that the optimal transmission probability for 
each user to transmit in a specific time slot is a(n)/n, where asymptotically a(n ) —► 1 hence, the 
probability 1/n is a good approximation. By using this rule we can estimate the optimal threshold 
value for the time dependent channel as well: 


1 - H(u) 


1 

K 


Hence, 

1 - P F g( u ) - qF b (u) = —, 

where the above represent the probability that a user capacity will exceed the threshold u. With 
similar derivation as in appendix [A] we get that 


u 


KT—r,(, h»s? + h°si °sK 

=«»v'2 WK ^1-- 


(log K 2 


Eg 


=b 


K 


(5.2.4) 


26 











5.2. Capacity - Distributed Scheduling 


Chapter 5. Capacity under time dependent channel 


c 

( 


3.5 

3.5 




o 

3.0 

3.0 


2Sr- 

-=- 2.5 


2.0 

2.0 


1.5 

1.5 

; : '• ..v .. •'.. • • ■ ■; a• *■ • > 

1.0 

1.0 

• 7 '■ "• . / \ V. •. •• ; 

0.5 

0.5 

:v ..... A. 




0.0 

0.2 0.4 0.6 0.8 1.0 n 0.0 0.2 0.4 0.6 0.8 1.0 


(a) 

(b) 

c 

c 

3.5 

3. 

o 

5 [ o 


3.0 f : : : 3 .of . 



Figure 5.4: Point process simulation for the exceedance of the threshold u = bn in a time dependent 
environment where(a) K=100 (b) K=1000 (c) K=5000 (d) K=10000 


Figure [574] gives simulation results for the users’ capacities point process, and the exceedance above 
the threshold u as given in (5.2.4). One can see that as the population grows, the threshold value 


also grows in a way which maintains the average probability of exceedance l/K. We can see that in 
Sub-Figure |5. 4a no exceedance occur among all users where in the other cases one user did manage 
to exceed alone. 


5.2.3 Refinement On The Threshold Value 

While considering i.i.d. random variables as the sequence of channel capacity, we can give a more 
accurate value for the threshold such that only one user exceeds it on average. The threshold given 


in the previous chapter (5.2.41, such that u n = bx, was formed due to the calculations of the mixed 


Gaussian distribution (5.1.1) which was the marginal distribution for a specific channel sample. As 
a reminder we are searching u n such that 


1 - F(u n ) = 


1 

K 


where in this chapter F is CDF of normal distribution with parameters (/x, cr), which has a known 
complementary function in contrast to the mixed Gaussian distribution. 

In 38 a use in the complimentary error function was done in order find threshold Uk such that k 


users will exceed it among the K users, in our case k = 1, hence we have 


u 1 


= M + V2crerfc 1 


= H +a\ 2\og(K) ~ log 


-2 tt ( 2 log ( - 


log 27 t 


(5.2.5) 


+ o ( K 2 ) . 
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Chapter 6 

Performance analysis - distributed 
algorithm 


In the previous chapter, we concerned ourselves with the question of what is the expected channel 
capacity in a distributed scheduling system based on the exceedance of some threshold uk■ This 
analysis touched just one part of the information communication chain, as we can ask relevant 
questions regarding delay and quality of service of the information, before it leaves the transmitter. 

K users wish to transmit data to a shared access point. Users are not necessarily fully backlogged, 
and each of them has a Poisson arrival process with rate A. Clearly this means a user may not have 
any information to send in a certain slot. We focus our attention on the users’ queues, as illustrated 
in Figure [Cr] and, specifically, characterize the service process for each user. The users scheduling 



Figure 6.1: System model. K users access a common channel. Each user has a packet arrival 
process with rate A 

is based on the distributed algorithm, where a user checks if its channel capacity is above some 
threshold uk , and transmits for one slot only if it exceeds it. The threshold is chosen such that the 
probability of exceedance equals l/K. Assume for now that this probability is a constant variable. 

In the following, we investigate the behaviour of the system, starting with referring our queueing 
model to the known model of slotted aloha. We then review a known approximate model, which 
captures the system performance, and extend it to fit our model. Finally, we present two additional 
approaches, under different assumptions, to approximate this queuing system by the analogy to 
known types of queues. We do so by considering the process of threshold exceedance, which has 
Poisson properties and is asymptotically considered as a Poisson process. 

In the first two subsections, we assume the users experience a time independent channel. Therefore, 
the rate for exceedance is the same for all users. In the last subsection we consider time dependency, 
and the possibility that users are in different channel states, hence, we have two different exceedance 
rates (as there is only value of the threshold). 
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6.1 Approximation by System and Users’ State 

It is not hard to see the resemblance of our model to the well explored slotted aloha system. Yet, 
the main difference is that in slotted aloha, the transmission scheme used is “immediate first trans¬ 
mission” i.e., if user i has an empty queue when a package arrives, it may transmit the package 
instantaneously, while in the case of collision, it may transmit again with a retransmission proba¬ 
bility pi. Our model follows a “delayed first transmission” scheme, where channel transmission can 
only happen with probability Pi (i.e., the threshold exceedance probability). When considering such 
a system consisting, of K queues, each with a memoryless arrival process yet a common server (i.e., 
the communication channel), the interdependence between the queues is very strong and cannot be 
ignored. It is reflected, among others, in the users’ queue length, probabilities for successful trans¬ 
mission or collision and the delay of packages. Several works in the literature explored the behavior 
of such systems, with the same terms of QoS. Due to the interdependence between the queues, 
each of these works considered a different simplified mathematical model, which resulted with an 
approximation for the required metrics. We elaborate on one specific work [3], which introduced a 
relevant approximate model. Although this work introduced a general model for both “immediate” 
and “delayed” first transmission, the analytical derivations and numerical results refers only to the 
“immediate first transmission” scheme, and not the “delayed first transmission” scheme we are in¬ 
terested in. We will briefly review this model and emphasize the differences in the mathematical 
derivations due to the different schemes. 


6.1.1 Review on the approximate model 


The model presented in 3! assumed that the arrival rate at user i is A i 1 and the arrival processes 
are statistically independent between the users. Time is slotted and it takes exactly one slot to 
transmit one packet. Thus, A i is the probability of arrival for user i in any given slot. User i 
attempts, with probability pi, to transmit the head-of-the-line packet in the queue (if the latter is 
nonempty) through a common channel to the receiver. 

At the beginning of a given slot the users’ status may be one of the three categories: idle , active 
or blocked. A user is idle if there are no packets in its queue at the end of the preceding slot. It 
is blocked if its queue is not empty and the latest attempted transmission was unsuccessful. It is 
active if its queue is not empty but its most recent attempted transmission was successful. We then 
let 


Pi = 


ri\i if i is idle, 

Si if i is active, 

qi if i is blocked. 


( 6 . 1 . 1 ) 


As can be seen, the setting of the problem is similar to the one considered in this study when 
Vi = Si = qi = j£. As mentioned, in jSj, the mathematical analysis and numerical results considered 
the case where r, = Si = 1, q t = Pi, which is the original slotted aloha paradigm. Specifically, 
the model consist of a coupled Markov chains, the system-status chain and the queue-length chain. 
The transition probabilities of each chain, as will be presented later, depend on the steady-state 
probabilities of the other chain and thus, all state equations for both chains must be solved simul¬ 
taneously. The system-status chain tries to capture the state of each user in any given time, hence, 
the status variable S consists of K ternary variables, Si, S 2 ,Sk, each of which indicates the 
status of the corresponding terminal. Namely, 


Si = 


0 if i is idle, 

1 if i is active, 

2 if i is blocked. 


( 6 . 1 . 2 ) 


The total number of states achievable by this vector is given by 


K 

E 

2=0 


K 


K -1 

*E 

2=0 


K- 1 


K 


= E 


i=0 


K 


i = 2 


K-l 


(K + 2) 


(6.1.3) 
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where i indicates the number of blocked users, along with the observation which no more than one 
active user may be present in the system in any given time. The new calculations of the transition 
probabilities is according to the “delayed first transmission” scheme, which are different from the 
one presented in |3j displayed in appendix [D] Two quantities which are needed for the calculation 
of the transition probabilities are: 


Pi( 1 | 1 ) = P r (queue size > 1 | user i is active) 
Pi( 0 | 2) = P r (queue size = 1 | user i is blocked). 


(6.1.4) 


Note that these are not the transition probabilities of a specific chain, rather, they reflect the 
coupling between the two chains. Along with these quantities, one can calculate P(S), the joint 
steady-state probability distribution of the random vector S. 

The queue-length Markov chain tracks both the status and queue length of each user, indepen¬ 
dently of the status and queue length of the other users. Specifically, the pair (T i: Ni) represents 
the state of the user and 7r(T), N t ) denotes its steady-state probability. N. t is the total number of 
packets at queue i and 

0 if z is blocked, 

1 if z is unblocked. 


Ti = 


The transition probabilities of the chain depend on the following average success probabilities: 


P B (i) = P r (success | user i is blocked) 
Pa(«) = P r (success | user i is active) 
Pi{i) = P r (success | user i is idle), 


(6.1.5) 


where the averaging is performed over the status of the other users (the status of the system). 


Namely, in order to calculate the probabilities (6.1.5) one needs the stationary distribution of the 


system-status chain P(S). This leads to the coupling of the two sets of equations. The calculations 
of this average success probabilities are also different from the one presented in |3 , and can be 
found in appendix [EJ The values which are in special interest to us, in order to help understand the 
system performance, are 

’’I 0 ’ 0 ) = ' „ A,Pj(Q 


41 , 0 ) = = 


A*Pa(*) + A iP B {i) 

A jP B (i) - \Pa{i) 
KP B (i) ~\{Pi(i) ~ P A (i)) 

4M) = =M°,o) 

Ai 


Gj(l) = T 


A,: A iP i (i) 


G*i(l) — Aj + A^=r 


A iP B {i) — A i(Pi{i) — Pa(i )) 
A iP B {i) — XiPA{i) 


( 6 . 1 . 6 ) 


(6.1.7) 


( 6 . 1 . 8 ) 


(6.1.9) 


( 6 . 1 . 10 ) 


'A i P B (t)-A i (P/(t)-P J 4(i)) 

where |6.1.9| and |6.1.10| are defined as the conditional moment generating function of the chain as 
follows 

OO 

G^z) 4 n(Ti,Ni)z N ', T t = 0,1 (6.1.11) 

Ni= 0 

and A = 1 — A,P = 1 — P. The expressions presented here are the same ones as presented in jsi], 
and also the mathematical development to attain them is the same. In order to find the steady 
state of the chains, as explained, the state equations must be solved simultaneously. The way 
to do so is by an iterative process, each time given the auxiliary quantities Pj(l | l),Pj(0 | 2) 
and P/(i), Pa(*), P B (i), as shown in Figure 6.2 The iterative method used is Wegstein’s iteration 


scheme, and after achieving satisfying convergence, one may calculate the performance metrics using 
the steady state of the chains. More details can be found in [3,65 . 
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Figure 6.2: Iterative algorithm for the solution of the coupled sets of equations for the state prob¬ 
abilities of the system chain and the queue length chains. 


The performance metrics 


The delay a package endures from the moment it is generated and arrives to a user’s queue until 
it is successfully transmitted, consists of three components, the time in line W q (i), the service time 
W s (i) and transmission time. Each of them is approximated separately and the sum of the three 
gives the total average delay. The following expression is the delay at a user i |3|. 

A = W q (i) + W s {i) + 1 (6.1.12) 


The 1 added represents the transmission time, which takes one slot, 
mated by 


= G 0 (l) 

1 — A,7t(1, 0) Pb(*)’ 


The service time is approxi- 
(6.1.13) 


where the service time, as defined, is the time a package exists in the head of the queue. In case 
a successful transmission happens in the immediate arrival of a package, the service time is zero. 
In any other case, meaning the user is in blocked state and has packages in his queue, we counts 
the number of slots until a successful transmission, i.e., a geometric random variable. Note that 
in |3], the service time was approximated differently, considering idle slots as zero service time. 
Clearly, there is no meaning to service time with an empty queue, therefore, service time should be 
normalized with the probability of not being in idle state with no arrival of a package. The time in 
line is calculated by using Little’s result, hence 


W q (i) 


Li 


(6.1.14) 


where Li is the average queue length of a user (without considering the blocked head-of-line packet 
as part of the queue), which is given by 


L = _ AfA t Pi(i) _ 

1 (KP B {i) - \iP A {i)){\P B {i) - a i(Pi(i) - P A m ■ 

The total weighted average system delay D is therefore 

y K i aa* 

D = ^‘=1 

EtiA i 


(6.1.15) 


where A = 1 — A, P = 1 — P. The probability for success given that the user is not idle, i.e., has 
package to send, and exceeds the threshold is given by 


Psuccid) 


P A (i){G\(l) - 7T(1,0)) + Pj(»Ml, 0) + P b {i)GI{ 1) 

Pi{ 1 - 7r(1.0)) 


(6.1.16) 
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where the numerator is the success probability regardless the users’ state, divided by the probability 
that exceedance occur with a non empty queue. 


Mean queue size A^=^(l-0.001) 

0 



■ Simulation 

■ Analytic 


Time in line Aj-=^ (1-0.001) 

t 



■ Simulation 

■ Analytic 



■ Simulation 

■ Analytic 


Figure 6.3: Anaclitic results and simulation for the system performance as a function of the number 
of users. Total arrival rate is At = -(1 — 0.001). (a) The mean queue size, (b) The time in line, (c) 
The service time, (d) The probability for success. The red line describes the anaclitic expressions 
16.1.15b, 16.1.14b, 16.1.13b, 16.1.16b respectively. 


6.1.2 Performance analysis and comparisons 


We present numerical results under “delayed first transmission” scheme. We divide the delay to its 
different components, i.e., the time in queue, service time, average queue size, system throughput 
and, in addition, the success probability p succ given that a user has a package to send and exceeded 
the threshold. Due to the fact that there is no exact solution for the slotted aloha system, a com¬ 
parison was made only with simulation results for this system. As explained earlier, the number 
of states in the system-status Markov chain grows exponentially with the number of users, there¬ 
fore, the simulation and the numerical results were compared for 2 to 10 users. The performance 
parameters were also checked as a function of the arrival rates. 

The simulation and analytic calculations were preformed under the settings of a symmetric 
system. The arrival process and the threshold exceedance process were approximated by a Bernoulli 


process, due to the discrete nature of the system. Figure [6f3| depicts the performance metrics as a 
function of the number of users, where the total arrival rate is close to maximum value which still 
allows the system to be stable. The results depict very good match between the simulations and 
the analytic results given in (6.1.13), (6.1.14), (6.1.15) and ( |6.1.16 ). 

Very important result is the fact that the probability for successful transmission, p SUC c = 1 —Pcoii, 


behaves exactly as predicted by the analytical model (Figure 6.3d), which means that it could 


be regarded as a constant average probability regardless of the strong interdependence between the 
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queues. This discovery is the foundation for the next two subsections, which, in contrast to the 
model considered thus far, are able to capture the system’s behavior for large number of users. 
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Figure 6.4: Anaclitic results and simulation for the system performance as a function of the number 
of users. Total arrival rate is A t = 0.15. (a) The mean queue size, (b) The time in line, (c) 
The service time, (d) The probability for success. The red line describes the anaclitic expressions 
(6.1.15), (6.1.14), (6.1.13), (6.1.16) respectively. 


Figure |6.5| and |6.6| depicts the performance as a function of the total arrival rate, which help 
us deduce a few interesting observations. First, for small and moderate values of the arrival rate 
one can see very good agreement between the analytic results and the simulation while near the 
maximum value, the instability region, the values start to diverge, especially in Figures |6.5a| and 


The second observation is that as the number of users grows, one can see even stronger compliance 
with the analytical results. This suggests that this approximate model may be suitable for a large 
population. Unfortunately, as mentioned, the calculation of the system steady state is intractable 
due to the exponential growth of the number of states with the number of users. In the following 
section, we give a simplified model, which utilizes the finding thus far and is able to approximate 
the performance for a large population as well. 


6.5b These results are consistent with the results of 3j, which used small values of A's as well. 
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■ Simulation 

■ Analytic 



(c) 


(d) 


Figure 6.5: Anaclitic results and simulation for the system performance as a function of the total 
arrival rate. The number of users K = 7. (a) The mean queue size, (b) The time in line, (c) 
The service time, (d) The probability for success. The red line describes the anaclitic expressions 
(6.1.15), (6.1.14), (6.1.13), (6.1.16) respectively. 


6.2 Approximation by Constant Collision Probability 


From the previous section it is clear that for large population, the performance metrics of the 
system are hard to calculate. In this subsection we will describe the service time using a different 
approach, which assumes that the probability of collision is constant and equals the average collision 
probability. The legitimacy comes from the results given in the previous section. Another work 
which used a constant collision probability as the key approximation is 79 . There, a similar model 


was considered, with n users competing on a shared medium, however the users were considered 
backlogged and only throughput performance was investigated. Nevertheless the results of the 
approximation in |79] show good agreement with the simulations, which strengthens its applicability. 
In 66 , independent M/M/1 queues were assumed, with a given distribution on the number of 


backlogged users. This results in an approximation with a varying collision probability. Herein, we 
use a different relaxation, assuming a constant collision probability. This allows us to easily decouple 
the queues. We start by investigating the exceedance process, by giving the terms for convergence 
to the Poisson process in the independent case of the channel, which connects directly to the service 
time. Then we turn to the service time analysis 


6.2.1 Threshold exceedance process 

The point process approximation as described in the preliminaries gives us convergence to a non- 
homogeneous Poisson process N of the points exceeding u n . Here we shall use the notation given 
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Figure 6.6: Anaclitic results and simulation for the system performance as a function of the total 
arrival rate. The number of users K = 2. (a) The mean queue size, (b) The time in line, (c) 
The service time, (d) The probability for success. The red line describes the anaclitic expressions 
(6.1.15), (6.1.14), (6.1.13), (6.1.16) respectively. 


in 72 for the Poisson properties of exceedance. 


Let Ci, C 2 , C n , a sequence of i.i.d. random variables with distribution F , be the channel capacity 


draws of a specific user. From 72[ Theorem 2.1.1], if u ri 
k = 0,1,2,..., 


satisfies n( 1 — F(u n )) —► x, then for 


P(N n < k) —► e 


k a 


s =0 


Si 


where N n is the number of exceedances of a level u n by {C n }. It is important to note that 
convergence in distribution happens if we change the “time scale” by a factor of n as defined in 


the definition of N n in (3.4.1). Meaning that we have a point process in the interval (0,1] and the 
exceedance of such points has a limiting Poisson distribution. This restriction may be problematic 
in terms of the data arrival rate which will also need to be changed in a factor of n for the analysis. 
However, under certain conditions, there may be convergence of the exceeding points for the entire 
positive line in place of the unit interval. In 72} Theorem 5.2.1 (**)], it is stated that if for each 
x > 0, exists a sequence {u n (x)} satisfying n(l — F(u n ( x))) = nP{C\ > u n (r)) ->Tasn->oo, and 
that D(u n (r)), Z)'(w n (x)) hold for all x > 0, then for any fixed x, N n converges in distribution to 
a Poisson process N on (0, 00 ) with parameter x. Clearly the dependence conditions hold for this 
case, since the sequence { C n } is i.i.d. so we only need to show that the first condition holds for 
each x. 


Lemma 3. Assume F is the Gaussian distribution and let a n and b n be given according to (3.3.6) 
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Poisson convergrnce 
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Figure 6.7: Simulation for 10000 user’s capacities which fol¬ 
lows an i.i.d. Gaussian distribution, showing the behaviour 
of exceedance which converge to a Poisson distribution (the 
dots) with parameter x. 


Figure 6.8: Values 
table of figure |6. 7 


and (3.3.7) respectively. Fix any x > 0 and set u n ( t) = logl ^ T + b n Then, 


lim n(l - F{u n (x))) = x. 

n—>oo 


Proof. In a similar way for the derivation of the normalizing constant in appendix [Aj and based on 
the derivation in 72 Theorem 1.5.3]. Let us find u n { t) which satisfies the equivalence condition 
for the convergence of the expression n(l — F(u n ( t))) 


n(l - F(u n (x))) 
nf(u n (x)) 
Un(^) 


as n —>• oo 

as n —> oo 


where the second line is true due to the Gaussian relation 1 — $(u) ~ as u — > oo, which in our 
case u n ( t) grows with n. So 


V2n 

— log V2tt — 


1 cq X U n (x) 

e 2 -» -— 


as n —> oo 


«n( T ) 


logT + log(u„(T)) - logn as 


we know that log(w n (x)) = ^(log2 + log logn) + o(l), hence 
v? (t) 11 1 

-V - =log-- log 47 t — - log log n + log n + o(l) 

z T z z 


“nW = 2 logn 1 + 


Unix) = y/2 log n ^1 + 

. , logl/x 
U n {x) = - \-b n 


log \ - \ log 4-7T - \ log logn 
logn 

log \ - \ log 4tt ^ log log n 


+ o 


2 log n 


logn 


logn 


where the penultimate line is due to Taylor expansion. 


o 


After proving that all conditions hold and we have a convergence to a Poisson process on the real 
line of the exceeding points, we can conclude that a user tries to transmit at a rate of x assuming he 
has packages to send. We will note this Poisson process of threshold exceedance as N exc . Note that 
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the time between each user exceedance is distributed exponentially with parameter t. The above 
leads us to a discussion on the duration of a slot. 

At the limit of large number of users, the capacity seen by a user who exceeds the threshold 
is high (in fact, it scales like 0(a\J 2 log A' + /it), see (5.1.8)). Hence, for any finite size of a data 
packet, transmission time tends to zero as the number of users increases. Furthermore, in this case, 
as convergence to Poisson process N exc exists, events duration also goes to zero. This motivates us 
to suggest the asymptotic model, where transmission time is negligible but still exists, hence under 
this assumptions we will refer the slots as mini slots to emphasize this fact. 


Remark 2 (Zero collisions). The users are independent, and each of them is also characterized with 
the same N exc process. As known, one of the properties of a Poisson process is that no two events 
can occur in the same time. Therefore, once a user exceeds u n , assuming it’s transmission time 
goes to zero, the probability that other users will exceed u n goes to zero as well. While considering 
this scenario, we can observe an approximation for the behavior of the user’s queue with Poison 
arrival process with rate A and exponential service process with parameter x, as the familiar M/M/1 
queue. This setting gives as an upper bound for the service time and other QoS properties. Still this 
scenario is treated only as a guide line for the analysis, since we are assuming that the services are 
considered to be synchronized, and hence collisions may occur and result with decline in the rate of 
service. 


6.2.2 Service time analysis 

The service time, which is the time from the moment a package becomes first in queue, until it is 
successfully transmitted, depends on the rate which a user exceeds the threshold and the probability 
for success (the probability which collision did not occur), given that the user has a package to 
transmit. The time between threshold exceedances is exponentially distributed with parameter t, 
due to the Poisson properties of the exceedance process (PPA). Therefore, the remaining parameter 
for the service time analysis is the probability for success. In fact, this probability is the main key 
for understanding this complex system. Due to the strong interdependence between the queues, 
this probability is constantly fluctuating and by quantifying it using average collision probability, 
one can approximate the service time. 

We start with the assumption that the users are backlogged and give the appropriate conclu¬ 
sions regarding the success probability. Then we turn to our model with memoryless arrival process. 
From now on, we will refer to the probability of collision which is the complementary probability of 
success, i.e. p co u = 1 - p SUC c- 

We define a collision scenario, if at least two users exceed the threshold u n and both has packages 
to transmit. The analysis is done by considering a specific user i at the time which he exceeds u n 
and assuming his queue, Qi, is not empty. Given that, we note p co u as the probability for collision, 

Pcoiu = P( IJ Cj(n) > u n ,Qj > 0 | Ci(n) > u n ,Q. t > 0 ) ( 6 . 2 . 1 ) 

j=l..K,jj£i 

where Qj is the number of packages in the queue of user j. In general the probability p co u depends 
on the states of the other users queues. We will analyse p co u of a specific user, starting with the 
more simple analysis, resulting with an upper bound, assuming all users always have packages to 
transmit. 

Backlogged system 

In the case which all users are backlogged, the probability for collision is constat and independent 
among the users. The state of the users’ queues does not change over time, since their queue is 
always full, and the only thing that impacts the probability for collision is whether a user exceeds 
u n or not. Then, by observing the N exc process, a certain point indicates a successful transmis¬ 
sion with probability p SU cc and a collision with probability p co u = 1 — p SU cc- Therefore a random 
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selection is made from N exc resulting with a Poisson process of successful transmissions with rate 
t • (1 — Pcoii)i which we will note as N tr - The proof is quite obvious since the probability that an 
arrival occurs from the original process in the interval dt is xdt, which is independent of the arrivals 
outside the interval, where after the random selection, the probability for an arrival in the interval 
dt is (1 — Pcoii) ■ t dt (independent of the arrivals outside the interval). Thus we conclude that 
the number of successful transmissions is a Poisson random variable with parameter (1 — p co ii) ■ t. 
Nevertheless a well appointed proof can be found in appendix [C| 

Consider the following problem under the framework of the general problem. We have one user 
with arrival process with rate A and Af^ acklogsed process with rate t • (1 — P^oii^ ogeed ), where all 
other users are backlogged. The definition of the problem in this manner will help us obtain an 
upper bound on p co u by understanding the fact that the condition of a user in the original problem 
is better, in a way that users may have empty queues and although exceedance occur it will not 
result in a collision. This setting isolates for us the other users queues’ states, from the analysis of 
the probability for collision. 


In a given time the probability that a user exceeds the level u n is 1 — F(u n ), so in order to have 
a collision, at least one user must exceed u n among all users, hence we get 


^ klOSSed = P{ U C A n ) > u n I Ci{ri) > u n ,Qi> 0) 

= 1 - P(Ci(n) < u n , C K -l(n) < U n ) 

( 1 \ K ~ x 

= l-(F ( Un )) K - 1 = l^[l--j 

*4°° 1 _ e-1 


( 6 . 2 . 2 ) 


Again the value u n is set such that only one user on average exceeds it in each slot, hence 1—F(u n ) = 

1 IK. 

This analysis is based on the fact that all the users have packages to send in any given time, so we 
can conclude that p co u < Pco« kl ° sged ■ Therefore a lower bound for the service process N tr rate is 
T-(l-p“ SSed )- 


Memoryless arrival 

The assumption of constant and independent collision probability relaxes the analysis dramatically. 
We can now assume that each user’s queue behaves like an M/M/1 queue, with a Poisson arrival 
process with rate A and exponential service time with parameter (1 — p co u) ■ r, due to random 
selection with probability 1 — p co u from the exceedance process. By doing so, we can use the known 
results for the probability of an empty M/M/1 queue, which is 

P(Q = 0) = 1 —p = l— ^ t , (6.2.3) 

where Q is the number of packages in the queue. We thus have the following lemma. 

Lemma 4. The users queue can be modeled as an M/M/1 queue with an arrival rate A and service 
rate (1 —p C oli) • x, where r is the rate of the exceedance process, N exc , and the probability for collision 
p co ii satisfies the equation 

Pcoii = l-e~ v-pcoiO-t . (6.2.4) 

Proof. As random selection (with probability (1 — Pcoii)) is preformed on the exceedance Poisson 
process, this results in a Poisson process for successful transmission, which leads to an exponential 
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service time with parameter (1 — p co u) ■ t. Let us examine the probability p co u. 
The probability that a specific user exceeds u n , and its queue is not empty is 


P(C l > Un, Qi> 0) = P{C t > u n )P(Qi > 0) 


J_ A 

I< T-(l ~ Pcoll)' 


Hence, given that one user exceeds u n and its queue is not empty, p co ii is the probability that among 
all the other users which exceeded u n , at least one has a package to send. Thus 


K-l 

Pcoll — ^ ^ 
»=1 


K - 1\ / 1 

i 


A 


= 1 - 1 - 


K—too 


l 


K X • (1 — Pcoll) 
K-l 


A 


Kx ■ (1 - Pcoll) 


—> 1 —e t "0 PcoiP 4 


1 A 

A' T ■ (1 -Pcoll) 


K-l-i 


□ 


This simple result helps understanding the behaviour of the system’s service time. Of course, 
equation (6.2.4) is an implicit equation and a numerical method is needed in order to find the value 
of Pcoll- In Figure 6.9 one can see that the numerical calculation for the success probability (the 


blue line), as given in (6.2.4), coincides with the results of the simulation and the approximate model 
from the previous section. It is important to emphasise that for this comparison, the number of 
users is relatively small and therefore we used the equation in its explicit form, meaning without 
taking K to infinity. The good agreement of the results implies that for the case which K —>• oo, 
Lemma [4] is able to describe the behavior of the users’ queues, hence the performance metrics of 
the system as a system of M/M/1 queues. Simulations for large number of users show excellent 
agreement as well, as can be seen in Figure 6.10[ where a comparison of the average service time 
was made between an M/M/1 queue with service rate Psucc^i which was calculated according to 
equation (6.2.4), with simulation of the system, which of course includes dependent queues and 
variable p succ 


P.ucc,Ar=J (1-0.001) 
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Figure 6.9: Comparison of the success probability between the analytic derivations of the approxi¬ 
mate model I, simulation results, and the approximation of K independent M/M/1 queues given in 
Lemma [4j 
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Service time \ T =0.3 


t 



■ Simulation 

■ Analytic 


Figure 6.10: The service time of an M/M/1 queue with service rate p SU cc T, compared to simulation 
results of system with K interdependent queues. 


6.3 Approximation by Constant Collision Probability - Time 
Dependent Channel 

In chapter [5] we analyzed the channel capacity, where each user experiences a time dependent pro¬ 
cess for the channel gain. There we assumed that all users are backlogged, therefore queuing aspects 
were not taken into account. We showed that under our distributed scheme the exceedance process 
of a specific user, above some threshold uk , is exponentially distributed with some rate t. This 
rate of exceedance was derived from the stationary distribution of the gain process, and there is no 
real separation between users that are in good state or in bad state in means of different rates for 
the corresponding states. The difference is embodied in the stationary distribution, thus t is the 
average rate of exceedance for such a system, which a user spends some of the time in the good 
state and some in the bad state, according to the transition probabilities. 


Then, in the beginning of chapter [6] we turned to the analysis of the queues in such distributed 

we gave an 


multiple-access system, when we assumed a memoryless arrival process. In section 6.1 


elaboration for a known approximate model which gives us performance parameters for such sys¬ 
tem, and due to its bad scalability for a large population we gave in section [672] another approach to 
approximate the users’ queues in a large population system. We show that the collision probability 
in each slot may be treated as constant, where this is called decoupling approximation, which works 
nicely when the number of users is large. Both sections’ analysis was preformed while assuming 
that the users’ capacities are i.i.d. random variables and therefore the threshold value was set 
accordingly. 

It’s an interesting question, considering the stationary good-bad capacity sequence as described 
earlier in (4.1.1), is it enough to set only the threshold value for the stationary case, as given in 


5.2.4 with the relevant normalizing constants, as given in Theorem [HJ to capture the service time 


behaviour as described in Lemma[4] We will not try to answer this question now, but rather propose 
another model which combines the queuing system with the users’ good-bad channel process. 


Assume now that each user experiences a good or bad channel, according to the time dependent 
model in Figure |4.1| Assume further that the transitions between the states are exponentially 
distributed with rates a and /3. Depending on the state a user exists in, it exceeds the threshold with 
a different rate. Namely, if a user is in a good state, it would exceed the threshold more often than 
when being in a bad state. Hence, the user’s queue is modeled as follows. A Poisson arrival process 
with rate A, and an exceedance process which is assumed to be exponentially distributed according 
to the user’s state. If a user is in a good state, the exceedance process is exponentially distributed 
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with rate fi g , and if it is in a bad state, the exceedance process is exponentially distributed with 
rate fib■ 

Since we consider multiple user system, where each user competes for the channel distributively, 
collisions may occur. Since we consider this probability as constant, we have a similar conclusion, 
that is, random selection is preformed from the exceedance process, with probability p s1icc , regardless 
the state which the user exists in. This results in an exponentially distributed service process with 
rates fi g ■ p succ or fib • Psuca depending on the good-bad state respectively. The individual queue 
model for this case is presented in Figure |6JT) 


Pcoll 


Psucc 


Figure 6.11: Queue model with varying service rates according to a time dependent channel. 


This time-dependent queue model can be represented as a continuous Markov process on the set 
of states {it^} for i S {b, g}, which indicates the good or bad state, and m = 0,1,2... the number 


of packages in the queue. This two dimensional Markov chain is presented in Figure 6.12 To ease 
notation, we denote fi' g = fi g ■ p succ and g! b = fi b ■ p SUC c- 



Figure 6.12: Queue model for time dependent user 


In 80 , the authors investigated a problem with heterogeneous arrivals and service to an M/M/I 


queue, which was solved by analysing a similar Markov chain to the one presented in Figure |6.12| We 
make use of the solution in 80 , with the right adjustment to our model. The system is solved while 
using generating-function techniques, resulting with a solution of the steady state probabilities, 
{ 7 ^}, as a function of the transition rate parameters and the root of a third degree polynomial g(z) 
(only one exist under the assumptions). Here, we will show the relevant fitted steady state results 
of the chain. First, let us define 

A = Kgfl'g + TTbflb 


where 


^Tb 


P 

a + /? 
a 

a + /3 


p is the average service rate. It is important to emphasize that in order to maintain stability of the 
system, we require that fi > A. We define the partial generating functions of the system as 


G i( z ) = 7r ™^ m \z\<l,i = g,b. 

m—0 


which eventually equals to 


G g( z ) = (£(£ - A)2 + - z )( Xz ~ Pb)) / g ( z ) 

G b (z) = (a(£ - A )z + Tr b 0 fi' b {l - z)(Xz - p! g )) / g(z ), 
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where g{z) is 

g(z) =A 2 z 3 - (ccA + /3A + A 2 + A/4 + \g! g )z 2 

+ (a/4 + Pn' g + p! g p! b + A/4 + A p! g )z — p! g p! b 

The steady state probabilities for an empty queue, depending on the system state, are 

n g = /3(A ~ a )A) 

770 - *o)(/4 - Az 0 ) 

b a(p,-X)z 0 


(6.3.1) 


(6.3.2) 


° /4(1 - zo)(v' g ~ Az 0 ) 

where zq is the root of the polynomial 3 ( 2 ). The remaining steady state probabilities are as follows 


. m— 1 m— 1 ^ 

= ^- 1-7 + to>0 

^9 i =0 M 9 j =0 ^9 

. m—1 q m— 1 

=^- 17 + m>o 

Vb j =0 ^b j=0 


(6.3.3) 


From the first derivative of the partial generating functions, we can attain the expected queue size, 
which includes the head of line package, unlike the analysis in subsection | 6 . 1 | 


Q = 


A , /4(/4 - A ) 7r 0 + VbWg - A ) ?r 0 - K - A )(X - A ) 


g — A (a + /3)(/i —A) 

Therefore, using Little’s theorem we can attain the average waiting time in the queue: 


W = 


Q 

A' 


For the completeness of the above results, we need to calculate p SU cci which until now is a fixed 
and unknown parameter. Hence, in the same manner as the previous subsection, we define the 
probability that a specific user exceeds the threshold and its queue is not empty, to be 


P t = Pr(transmission attempt) 

= Pr(exceedance occur & queue is not empty) 

= (G fl (l) - ^)(1 - e-^) + (Gb(l) - tt$)( 1 - e-^), 


(6.3.4) 


where the first term consists of the probability to be in a good state with a package to transmit, 
times the probability which an exceedance occurs while existing in the good state; The second term 
is similar, referring only to the bad state. The probability for success, given one user is about to 
transmit, can be obtained by a similar calculation as in Lemma [4j and we have 


Psucc — (1 P) 1 ) 


K -1 


(6.3.5) 


Now, in order to attain this probability, the above equation and the third degree polynomial 


(6.3.1) must be solved simultaneously, since the root Zq and p succ are coupled. As K —> 00 , the 


probability p SU cc remains constant and does not go to zero since the probability for a transmission 
attempt, Pt, scales like 0(1/AQ. This is true since in the first place, the threshold value was set 
such that on average, the probability for exceedance equals 1/A'. Thus, (1 — e _Ms ) = 0{c g /K) and 
(1 — e _/ii> ) = 0(c b /I\), where c g ,c b are some constants which satisfy c g > c b . Letting AT —► 00 , 
we get a constant probability. In Figure |6.13| one can see the probability for success p SU cc, which 
coincides with the simulation results and therefore the good agreement of the system performance 
with the simulations, as presented in Figure |6. 14 The simulations were preformed while assuming 
equal transitions rates between the good and bad state, i.e. a = /3 = 0.1. Since we are considering 
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a symmetric system, the arrival and service rate parameters, as mentioned in the figures, were 
considered as the total rates of the system and were divided equally among all the users. 
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Figure 6.13: Queuing system success probability results as a function of the number of users in a 
time dependent queue model. A t = {0.1, 0.3}, fig = 0.7, pib = 0.5 
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Figure 6.14: Queuing system performance results as a function of the number of users in a time 
dependent queue model. A t = {0.1, 0.3}, = 0.7,/if, = 0.5 
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Chapter 7 

Conclusions 


This work investigated the channel capacity and the performance of a multi-user MAC system in 
a time dependent environment under both centralized and distributed scheduling. Specifically, the 
expected channel capacity gain was derived in the case which the dependent capacity sequence 
was modeled as a stationary process characterized by the good bad channel Markov process. If 
the long range dependence condition D{u n ) and the local dependence condition D'(u n ) hold, the 
extreme laws are the same as if the sequence was i.i.d. but with different normalizing constants. A 
distributed scheduling algorithm for this system was suggested as well, and we showed that both 
approaches have the same scaling laws, the distributed approach being smaller only by a factor of 
e^ 1 , so there was no loss of optimality due to the distributed algorithm. 

The performance of the system was derived while considering queueing theory aspects. In 
fact this precise characterization is a very difficult mission, which up until today was not solved. 
Therefore we presented approximation models to describe its behaviour. First we address to the 
i.i.d. case where the users do not experience a time varying channel. For that case we elaborated an 
existing model and showed results for our paradigm. In addition we gave another approach, which 
relates the queues as independent, concerning the probability of collision in the random access 
mechanism, and enables us to consider them each as a much more simple queue. Lastly we tied the 
capacity part of this work with the queuing part when we suggested a queue model, which is time 
dependent and modeled by our good-bad channel model. We showed good agreement between the 
analytic model and the simulation results. 
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Appendix A 

Convergence to Extreme Value 
Distribution 


A.l Convergence proof 

Proof. In the following proof we give compliance for the convergence conditions and the derivation 
of ax and bx- The stationary distribution F(t) = pF g {t) + qF b {t) as shown earlier has a negative 
derivative f from xq = max{/i s ,/jf,} till oo. So we only need to show that (|5.1.2 ) holds, 


f(t)(l-F(t)) (Pf' g {t) + qfb( t )) (I - (pF g (t) + qF b {t))) 
}^o - fm - = t^So -- 


= lim 

£—>■ OO 


p(t) oo ( Pfg(t) + qfb(t)Y 

\ (pf'gjt) + qf b {t)) (pErfc (V^) + qErfc (^)) 
( Pfg(t) + qfb{t )) 2 


In 181 , 7.1.13, we can find upper and lower bounds for the complementary error function, 


V^t + Vt 2 + 2 


< Erfc(t) < 




where these inequalities are true for t > 0 which fits our case for t —* oo. Using these bounds we 
will show with the sandwich rule that the limit above converge to —1. Let us consider first the 
lower bound of the complementary error function, 


(t — flg ) 


(t-Ph) 


(pfg(t)+qf b {t)) P 


lim ■ 

£->-oo 


t-Hg , / (t-Mfl ) 2 To V^f t-Vh / O-Hh ) 2 I n 

\/ +z ,/5 5^ h/: 


V2cr 


( Pfg(t) + qfb(t )) 




= lim 


\/2 cr 


+ J ^£ t -+2 


\ i 




t->oo 


(p/sW 


t fig 


qfo{t) 


( pfg(t) + qfb{t )) 

I g s/gd) 


p 


= lim — \/2- 

£->-oo 




Qfe/b(«) 


\/2ct 


+ +2 
\Z2crjj '/ 9^-2 


(. pfg(t) + qfb(t )) 
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A.l. Convergence proof 


Appendix A. Convergence to Extreme Value Distribution 


The Limit above can be break to four different limits, 
j (I'j futfv _ a g J g (f) _ 


lim — \pip 2 


lim —\j 2 pq- 


, /(t-^ 9 ) IO 

V5^ + '/ -2 +- 




t-Hb 


<*bfb{t) 


( pfg(t) + qfb(t )) 

r ( + _ ^bfbjt) _ 

9 ' °g Mb , / (t~A* 6) 2 V 

HW + dW) 2 


+ lim — \/ 2 q 2 

t—¥ OO 


6 t — t*b 
y/2<Jh 




(. Pfg(t) + qfb(t)) 
ag/g(t) 


lim —\j 2 pq- 


•/2<t 


--^ 


(p/s(*) +9/&W) 


please notice that the first and the second limits are similar with the exception of their indexes, and 
so does the third and the fourth limit. We start with the first limit calculation. 


m ^ 


gg/gW 


lim — \/ 2 p 2 

t—> OO 


= lim —2 

t—>oo 


y/2a 


Ma + / (t-3a )2 + 2 


{pf g (t) + qfb(t )) 2 

P 2 m 


= lim — \/2 


P 2 /»W (t-Ms)o-g 


(Pfg(t) + qfb(t)Y cr| 


1 


(pfg{t)+qfb(t)f 


= , im - 2 ASM , 

^°° 0 pfg(.t) + qfb(t )) 


4/x2 

1 + A /i + 


= lim —2 

t—>oo 


P 2 Pg{t) 


r fftt) 

= lim — --- 

(pfg(t) + qfb(t)) 


7 • lim 

( pfg(t) + qfb(t )) 

- = - ( lim _^_ 

2 l ‘-°°(p/ 9 (t) + ?/ 6 (t)) 


4<^ 

(*—Ms) 2 

1 


1 + 


4cr g 

(*—Ms) 2 


== — I lim 


1 


1 + 


gh(t) 
Pfg(t) 


1 + lim^oo 


qfb(t) 

PfgW 


y qfb(t ) cr 9 g + cr g9 

inn ——— = — - lim e <> » = —-e 

0 -& p 


lim, 


t^00pfg{t) <T b pt-HX, 


lim 

t—>oo 


K - <) + - 2/x g of) + C 


2 a? 


So, 


b”g 


qfbit) 

lim t 77 \ 

PJgV') 


OO 
—OO 


OO 

0 


< °l 


(t-Vb) 2 i (t-Hgr 

-1 TZ?— 


<?n < 


a 2 > cr£ assuming /x g > /Xb 


> 05 assuming p, g > (Xb 


Note that in (a) we assume that p 0. This assumption implies that the situation which all the 
users are in the bad group is not taken in consideration. For that case all the users have the same 
channel and the analysis is known and not in our interest. In the same manner we assume also that 
q ^ 0 for the opposite situation. Hence, the first limit result is 


fg(t) 


t~Pg 


g g/g(*) 


lim —\/ 2 p 2 

t—> OO 


V2o 


L + 


(t-Hg) 2 , o 

2 it 2 +2 


( Pfg(t) + qfb(t )) 


0 

-1 


< °b 


a 2 > a b assuming p g > p b 


As mentioned earlier the first and the second limits different only in their indexes, therefore the 
result for the second limit is 


fb(t) 


t-Pb . 


<?bfb(t) 


lim —\/ 2 q 2 - 

t—> OO 


\/ 2 cr 


2 


+2 


C pfg(t)+qfb(t)) 


-1 
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a \ < a l 

a 2 > al assuming p g > p b 
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We turn now for the third limit calculation 
J a bfb jt) _ 


lim — \/2 pq- 

£—>■ OO 


= -2% lim 


V2cr 




( pfg{t) + qfb(t )) 
Pfg(t) 


• lim 


= lim —y/2 

t—> OO 


qfb(t) 


pqfg{t) (t- n g )a b fb(t)V2a b 


= -°\ lim 


0-2 t^oo ( pfg(t) + qf b (t )) t-s-oo ( pfg(t) + qf b {t )) t->°o (t-fi b ) t-> 00 
Pfg{t ) qfb(t ) 


{pfg(t) + qfb(t)) 2 a 2 g (t-Ll b ) ^1 + ^1 + {++) 
. lim (t ~ + . lim 1 


1 + \/l + 


4g -g 


• lim 


0-2 ( pfg{t)+qfb(t )) t-*-oo ( pfg{t) + qf b {t )) 


Cg — assuming fi g > p b 


a g < 

cr 2 > of assuming fi g > fi b 


= 0 


The fourth limit also share the same result, and if we add all the parts we can see that the limit 
converge to —1. If we consider the upper bound of the complementary error function we will find 
also that the limit convergence to —1 since the analytical development is the same and the limit 


lim 

£—>■00 


1 


1 + 



4g g ^ 


lim 

£—>■00 


1 + \/1 + 


T (*-Msh 


1 

2 


Therefore we can conclude that condition (5.1.21 holds for our stationary distribution. 


□ 


We now show that the second condition (5.1.3) also holds for the stationary distribution. Let 
us examine the expression: 


1 - F(t) = P ( 1 - F g [t)) + q(l - F b (t )) 

Since Fi(t) is a Gaussian distribution we shall use the asymptotic relation: 

, , (hit) 

1 — <h(f) ~ —— as t — > 00 


1 - F(t) = p ( ———) ) + q 

\t-pg a g 


where (a) is true since 


1 


\t~ Pg 

(a) 1 P 

V2nt- fig 


_ U-Ms) 

P e + 


&b i/i Pb\ 
7- <P{ -) 

t fJjf) (J 5 

q -e^r) 


t - fi b 


_ (t~Pg) Z 

e 2a 9 (l + o(l)) as t —» oo 


jt-Fb) 


(t-Hq) 2 


9.rr 2 
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£—>■00 


= lim 


£—>00 _ tt — Pg ) 2 
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= 0 


(A.1.1) 
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A.2. Normalizing constants 
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assuming a g > cr b . So taking in consideration condition (5.1.3): 


1 -F(t + xg(t)) _ ^ t+xg{t)-ti a e 


(t+Xg(t) — fig) 
9.rr‘ 2 


(1 + 0 ( 1 )) 


i - m 




+ (1+0(1)) 


y/2n t—^g 

J. .. — (t + xg(t) — fig ) 2 + (t — g.g)^ 

e -i (l + o(l)) 


t + xg(t) - g g 
1 


1 


*g(t) 

t-llg 


_s(t)i(i-/i 9 ) _a 2 (t)x 2 

e ^ e (l + o(l)) = 


By choosing g(t) = t °_ s ^ as the strictly positive function for t —» oo we get 


1 


1 + 


_g x e i(t-g g )' 2 (]_ _|_ 0 (1)) = e 


as t —> oo 


(‘-Mb) 2 

That conclude that the distribution function F(x) belongs to the domain of attraction of Type I. 
Similar analysis can be found in 82 , where some examples for convergence of sequences of inde¬ 
pendent random variables with the same mixed distribution is investigated. 


A.2 Normalizing constants 

Derivation of ax and bx- 

From Theorem [2] we know that ux is a sequence of real numbers such that K( 1 — F(ux )) — > T as 
I\ —>• 00, therefore in our case: 


1 -pFg(ux) - qF b (ux) -+ -j^e 


K -4 00 


where t = e x . The same way as the previous proof using (|A.l.l|) we obtain that 
pcr g 


u K - p g 


-<j>( ———) ) + 


———(/>(———) ) -»■ —e~ x 

u K - Pb &b J K 


1 


1 n 1 

-7=—e ^ (1 + 

+ 27r u K - Mg A 

the last step is true since ux -+ 00 as AT —► 00, similar to the pervious proof. 


- * log 27 t + logp - log (u K - Mg) - (yUK 2a 2 ^ + log K + x + o(l) -» 0 
It follows at once that (t — g g ) 2 /2\ogK —>• 1, and hence 


(A.2.1) 


log (u K - Hg) = -(log 2 +log log AT)+ o(l) 


Putting this in (A.2.1) ,we obtain 
(u K - Mg) 2 




= log27T + logp - ^ (log 2 + log log A") + log K + x + o(l) 
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or 


and hence 


(u K - Hgf 


= 2 log K 


x- \ log p - \ log log K / 1 


log A' 


log K, 


( 1 + *_z]^pp^ + Jl \ 

a g \ 2 log AT V lo g K J / 


so by using expansion we have, 


- - x ~ \ log p- — ^ log log K f 1 

uk = 0W21og K 1-1 - 


2 log K 

\ 

since we know that uk = x/gk + bx we conclude that 

y/2 log K 


V log K 


llg 


a K = 


b K 


^(2 log K) 1 ' 2 


log log a: + iog||\ 

2(2 log A') 1 / 2 J +Ms 
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Appendix B 

Capacity - Direct expression 
analysis 


B.l Capacity - analyzing change in group sizes 

The system can be modeled in a different way, still taking in consideration our channel model, we 
can analyse the channel capacity by addressing separately to the different groups which are differ 
by their channel state and evaluate the maximum for each group separately. Therefore the system 
is modeled as a first order time-homogeneous Markov chain with K + 1 states, each state represent 
The number of users which are in G state or in B state. In other words, The state Si(n) represent 
the system state in discrete time n, where i users are in Bad channel state and K — * users are in 
Good channel state. 

The Markov chain is irreducible and aperiodic chain. We can notice that it is possible to go 
with positive probability from any state of the Markov chain to any other state in a finite number 
of steps since the chain is a complete graph. And it is easy to see that since each state has positive 
probability to remain at the same state all the states are aperiodic, 
we will give now some properties for the system behavior which will be used later. 


B.1.1 Transition probability matrix 

The transition probability matrix P is given: 


nin {i,K—i,j,K—j} 


P = 

1 13 


E if*<j 


n—0 

min {i,K—i,j,K—j} 

E 

n =0 


(K-i\ 


-j+nJ 


if i 

<3 



for 0 < 

i,j < K 

if i 

>3 




(B.1.1) 

state 

from G to 

B or the 


opposite as shown in Figure 4.1 


The entries in the transition matrix are binomials probabilities, the probability to get from state 
i to state j depends on the number of users which in a given time n changed their channel state. 
The probability Pjj sums up all the probabilities of the cases that eventually lead to having j users 
in bad state and K — j users in good state. Each case describes the number of user transition from 
good to bad state with success probability a and the number of user transition from bad to good 
state with success probability /3 in a given time n. 
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Proposition 1. The number of transition cases possible, which begins in state i and ends in state 
j in a given time n, are min{j, K — i,j, K — j} + 1. 

Proof. The initial state i represents that there are i users in the bad group and K — i users in the 
good group and the target state j represent that there are j users in the bad group and K — j users 
in the good group. For example, the number of possible transitions, for K = 6, i = 2, j = 3, is 3; 
the first one is the transition of one user from the good group to the bad group and none from the 
bad group to the good group, the second is the transition of two users from the good group to the 
bad group and one user from the bad group to the good group and the third one is the transition 
of three users from the good group to the bad group and two users from the bad group to the good 
group. Another example, if i or j are 0 or K we would have only one possible transition. 

The expression min{i, K — i} defines the maximum transitions possible from the group regardless 
the target state j, there can’t be more transition cases than the size of the smallest group. The 
expression rnin{;;, K — j} defines the maximum transitions possible in order to obtain the new sizes 
of the groups, since again we cannot count more transition cases than the size of the smallest group. 
Finally the minimal value between the two should be chosen, so it would hold for both conditions. 
Hence, we get : min{min{*, K — i}, min{j, K — j}} = min{i, K — i , j, K — j} □ 


Let us consider the assumption that the transition probabilities in the good-bad channel are 
symmetric (a equals /3). 

The transition matrix Pj 3 under the condition a = (3, 

! min {i,K—i,j,K—j} 

E (/yj - a 

n—0 

min {i,K—i,j,K—j} 

E - 

n—0 


(B.l.l), can be written as follow: 


jK-j+i-2n if i < j 

- a) K ~ l+: ’~ 2n if i > j 


for 0 <i,j<K 


Looking on the structure of the equation above we can notice that Pij = P(K-i)(K-j) assuming 
i < j and without loss of generality. In both entries we get the probability: 


min {i,K—i,j,K—j} 

E 

n—0 


K-i 
j — i + n 


j-i+2n 


(1-a) 


K—j+i— 2n 


This applies also to the case where i > j. 

So the matrix has a structure as shown below: 


( a 0,0 

° 0,1 

Oo,K-l 

Oq,k\ 

« 1,0 

0-1,1 

Ol,K-l 

O-l,K 

o-l,K 

Ol,K-l ' 

Ol,l 

a l,0 

\ a 0,K 

Oq,K-1 ■ 

° 0,1 

a 0,0 ) 


Proposition 2. The transition matrix P is a Centro symmetric (Cross-Symmetric) Matrix. 

Proof. An ( n x n) matrix A = [Aij] is centrosymmetric when its entries satisfy Aij = A n -i + i^ n -j + i 
for 1 < i,j < n ( [83]). In our case, 0 < i, j < n and the matrix is ((n + 1) x (n + 1)), so we would 
get that the entries satisfy A l :j = A n _ i n _j for 0 < i, j < n. This consist with the structure of the 
transition matrix explained above. D 


Let us consider the (n x n) exchange matrix J, which satisfies I 2 = J where I is the identity 
matrix, and it entries defined such that: 


J%,j — 


1 if j = n — i 
0 if j ^ n — i 


for 0 < i, j < n 


Proposition 3 ( 83 ). An (n x n) matrix A is centrosymmetric if and only if J A = AJ . 
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B.1.2 Stationary distribution 


If a Markov chain is a time-homogeneous, so that the process is described by a single, time- 
independent matrix P tl then the vector 7f is called a stationary distribution and it satisfies the 
global balance condition : n = nP. 

In the case that the finite state Markov chain is irreducible and aperiodic, then the stationary dis¬ 


tribution exist and it is unique 84 


The characteristics mentioned above exist in our system model. We will investigate our stationary 
distribution If. 


Theorem 9. The stationary distribution 7 r of the Markov chain with transition matrix P is a 
symmetric distribution around its middle. 


Proof. Let 7r be the unique stationary distribution of the Markov chain which hold the global balance 
condition: 

7f = TfP 


Jfj ( = } TfP J 

M = 7f PJ 
7 T* = IfJP 


* —^ .rr* P 
7T = JT r 


Where (a) since the equation hold we can to multiply both sides with the matrix J as defined earlier, 
(b) any vector v multiplied by the matrix J turns to be the same vector v in its revers order, which 
mean v* = (c) P is a centrosymmetric matrix and according to proposition |3j PJ = JP (d) 

same as (b). 

As a result we can see that the vector 7r* satisfies the global balance equation therefore in order not 
to contradict the singularity of the stationary distribution the obvious conclusion is that w = n*. 
From this we can conclude that the stationary distribution is a symmetric distribution. □ 


A random variable £ and its distribution function F(x) are said to be unimodal with mode M if 
F{x) is convex on (— oo,M) and concave on (M, oo). The notion of unimodality has proved to be 
very fruitful, therefore its analogs have been introduced for discrete and multivariate distributions. 
In [85) some results for discrete unimodality can be found, we will consider some of these results for 
better understanding the stationary distribution If. A distribution {p n } with fc £ Z as its support 
will be said to be unimodal if there exists at least one integer M such that: 


Pn>Pn- 1 , for all n < M, 
Pn+1 < Pn , for all n > M. 


The distribution may consist of a single point mass, or it may be bounded on one side, both sides, 
or be unbounded. A discrete distribution {h n } is strongly unimodal if the convolution of {h n } 
with any unimodal {p n } is unimodal. In addition, a necessary and sufficient condition that {h n } is 
strongly unimodal is that ( [85| , Theorem 3): 

h n 2 > h n _ih n+ i for all n 


Another theorem presented is: The limit of a convergent sequence of strongly unimodal discrete 
distributions is strongly unimodal ( |85| , Theorem 1 ). 

Also, it has been proven that any binomial distribution is strongly unimodal since it is a convolution 
of a finite number of Bernoulli distributions which are strongly unimodal. This derives from the 
Lemma given in 


85 , that the convolution of two strongly unimodal discrete distributions is strongly 


unimodal. 


Theorem 10. The stationary distribution n of the Markov chain with transition matrix P is a 
discrete strongly unimodal distribution. 
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Figure B.l: Stationary distribution of the Markov chain system model. Where the transition 
probabilities in (a)a = /3 = 0.1 (b)a = 0.2 /3 = 0.1 (c)cc = 0.3 /3 = 0.1 


Proof. Since the stationary distribution is defined as the limit distribution in an ergodic Markov 
, Theorem 59), according to ( [85], Theorem 1), in order to prove that W is a discrete 
strongly unimodal distribution it is sufficient to prove that each row of the transition matrix P is 
strongly unimodal. 

Let us consider the following situation to describe the system model. Given the system state is 
i (i users in the bad group and I\ — i users in the good group), we define two binomial random 
variables, A' ~ B(K — i,a) and Y ~ B(i,a) which represent the number of users that traversed 
from the good group to the bad group and from the bad group to the good group in a time unit, 
respectively. Therefore, the distribution for the next system state J, i.e. the row i of the transition 
matrix P is: 

Pr{ J = j) = Pr(i + X -Y = j) = Pr{X -Y =j - i) 

The next system state is ruled from the given state i the amount of users that traversed to the bad 
group minus the users that traversed to the good group. The difference between the two binomial 
random variables X and Y gives the next state distribution, hence we only need to show that this 
distribution is strongly unimodal. In the case of sum of two independent binomial random variables 
it is very simple, since each of them has strongly unimodal distribution and the distribution of the 
sum is the convolution of the two strongly unimodal distributions which is strongly unimodal. Let 
us define a new random variable W such that W = —Y. The distribution of W is defined: 

Pr(W = —k) = Qa fc (l-a)N for fc = 0,1,2,...,* 

This distribution is clearly strongly unimodal since the necessary and sufficient condition in 
Theorem 3) holds. For Y, from its binomial characteristics, the inequality p'f. > pk+iPk-i 
Since p(Y = k) = p(W = —k), p{Y = k + 1) = p(W = —k — 1) and p(Y = k — 1) = p(W = — k + 1), 
the inequality p 2 _ k > p-k+iP-k-i for W holds as well. 

Therefore, we get that the distribution: 

Pr{ J = j) = Pr{ X — Y = j — i) = Pr(X + W = j-i) 


( !>]> 
holds. 


chain ( 84 


is strongly unimodal, since X and W each has a strongly unimodal distribution, the convolution 
between them will also give a strongly unimodal distribution 85 . □ 


From Theorems[9]and[T0] we can conclude that the stationary distribution of the Markov chain is 
a symmetric unimodal function. See Figure B.la for the case of symmetric transition probabilities. 
We will derive the capacity for this scenario given the current state of the system, while this state 
determines the amount of users experiencing the different channel distributions. 


B.l.3 Capacity analyze 


We consider the problem of finding Mk as defined in (5.0.11 and the distribution of {C} as defined 


(4.1.1). As mentioned earlier at first we will assume that the system is in a specific state Si(n) and 
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therefore we know the amount of users which are in good and bad channel state. It is important to 
understand that given the system state Si(n ), there are i users in Bad channel state which we denote 
by K\, and K — i users in Good channel state which we denote by A' 2 , note that A^= K\ + K 2 . 

In order to find the expected capacity we shall derive the capacity distribution of Mk'- 


k 


P r (M K < x) = ^P r (Mx < x | Si(n))P{Si(n)) = 

2=0 

K 

P r (max {Ci(n), C 2 (n), C K (n)} < x \ Si(n))P(Si(n)) 


(B.l.2) 


2=0 


Given Si(n) we get two independent groups A = {<21, ...,0^} and B = {bi, ..., 6 k 2 } where aj and 
bj represents the users’ capacities according to the group each user belongs to. Since every state of 
the system need to be taken in consideration, we cannot use the EVT for all the group sizes. Let us 
define a parameter ip for the minimal group size which we can use EVT to achieve the asymptotic 
distribution for the maximum capacity. Assuming A and B are big enough, the distribution of the 
maximum capacity in each group is the asymptotic distribution Gj(x,Ki) as in equation (3.3.3), 
where the parameters cik and bx will be a function of A'i and A'2. And for the states where one 
group is too small for us to use EVT we will treat in a straightforward manner. 

The distribution given the system state Si(n ): 

P r (M K <x | Si(n)) = P r (max{C'i(n),C' 2 (n),...,C'i<-(n)} < x \ Si(n )) 

= P r (max{ max{A}, max{P}} < x | Si(n)) 

= P r (max{A} < x,max{P} < x | Si{n )) 

== P r (max{A} < x | Si(n))P r (ma,x{B} < x | Si(n )) 

where (a) is due to the independency between the groups. So ( B.1.2| ) can be written: 

K 

P r {M K < x) = P r (max {A} < x | Si(n))P r ( max{P} < x | Si(n))P(Si(n)) 


i=0 

(p K—ip K 

= {x)G 2 (x 1 K 2 )+ Y, G 1 ( X ,K 1 )G 2 (x,K 2 )+ Y G\(x, K 1 )Fq 2 (x) 

2=0 i=(p -\-1 i=K—(p+l 

(B.l.3) 

Thus the expected channel capacity is: 

K 


E 


M k 


= Y e m k I Si(n) P(Si(n)) 


2=0 

<P 


d r. 


= ^P(5,(n)) / x— F^(x)G 2 (x,K 2 ) 


2 = 0 


K—ip 


dx 


+ ]T P(Si(n)) 


i=cp+l 

K 


dx - 

xP G 1 (x,Ki)G 2 (x,K 2 ) 


d 


(B.l.4) 


+ ]T P(Si(n)) I x—^&K^F^ix) 


i=K-<p-\-l 


dx 


dx 


In Figure [B~2] we can see the capacity PDF in the case N\ = 150 and N 2 = 150 where each group 
experience different channel distribution with = 1.25,/X2 = 0.25 and ay = <j 2 = 0.19 respectively. 
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«!> 



Figure B.2: Capacity Distribution with Two Groups distributed differently 



Figure B.3: Capacity distribution for variable size of Groups distributed differently. Where the 
transition probabilities in (a)a = /3 = 0.1 (b)a = 0.1 j3 = 0.4 


A more realistic simulation is in Figure [B~3| we can see the maximal MIMO capacity among 300 
users, with time variable size for each group (bad or good), and time variable state for each user 
according to the good bad channel Markov model. Achieved by saving a state vector for the users 
and updating it following the transition probabilities for each drawn. 


B.1.4 Capacity Bounds 


Finding a closed solution for the expected channel capacity for the expression given in (B.1.41 is 


very complicated, because each Ci{n) is determined according to a Markov process. One can think 
of some upper and lower bounds for the expected channel capacity in the analytical development 


in equation (B.1.4), the first naive lower bound would be to choose i such that gives the maximum 
probability from the stationary distribution 7f of the system model (figure |B.lc| ) in order to simplify 
the expression in (B.1.4), once we know the specific i we know the parameters K i and I \ 2 : 


E 


M, 


K 




K 


M K 7Tj 


i=0 


(B.l.5) 


(b) 

> maxl^} E Mx 
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where (a) is from equation (B.l.4) and (b) is to choose the maximum probability in 7r, note that 

P(Si(n)) = 7 Ti . 

The index i which satisfy the maxi{7q} in the case where the transition probabilities in the good- 
bad channel are symmetric is i = ^ as shown earlier in the analysis of the stationary distribution 
of the system. 

Since i = A which mean that the groups are still big as K —► oo, its fair to say that the EVT 
can be applied on the distribution of the two groups. 


Therefore, we can achieve a closed bound for this case by setting i = A in equation (B. 1.5): 


E Mk > 7r k_ E M x I 7 tk 


d 7 K ^ / K ^ 
= 7Tic / X— G 1 (x,—)G 2 {x,—) 


— oo 

OO 


(<0 

= 7 Tk / X 


dx . 

d r 


dx - 


( Gi ( a :,|)) 2 


dx 


dx 


OO 

J x il( 


a K v 2-i 

^- e 2 ' 1 dx 


(B.l.6) 


= 27 tk xe 


dx 


= 27 t k (( 1£7 + bK ) 

9^9 ' 


a, k_ 


oo 

= 27 tk [ (as In (-) + 6 k )e~ v dy 

j y 


where cik and bi< as defined in (3.3.6) and (13.3.7). 

2 2 - >- 

The first lower bound given above can be improved. In this bound we only took in consideration 


the maximum probability from the sum of probabilities in equation (B.l.3). As mentioned we Know 


that the two groups sizes are 0(K/ 2), hence EVT can be used, and another bound can be taken in 
consideration which includes only the states that represent similar group sizes. Let us consider the 
situation that there is more good users than bad users, i.e. ^ — 5 bad users and + S good users, 
where S can be chosen as we please in order to maximize the offered bound. Now if we discard 26 
users from the good group we would get a lower bound for the expected channel capacity, since the 
users who give the most contribution are the good users, so it is clear that if we reduce these users 
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we reduce the expected channel capacity. 


K 


E 


M k 


M K 7 Ti 


Mk I Ki 


= £ E [ 

2—0 

> t E [ 

i=§-6 

k_ 

2 

= ^2 E[G 1 (x,K 1 )G 2 (x,K 2 )]n i 

i=§-S 

K_ 

2 

> ^2 E[G 1 (x,K 1 )G 2 (x,K 1 )}TT i 


i=4-<5 


= ^2 ( a Kil + bi< 1 )n i 

*=§-<5 

> + n f-s S 


(B.l.7) 
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Appendix C 

Random selection Proof 


Let A be a Poisson process with intensity A and N* be a point process obtained from N such that 
each event is removed or retained independently with probabilities 1 — p and p respectively. So for 
any Borel set B , 


P(N* (B) = k) = J2 P(N(B) = i)P(N* ( B) = k\N(B) = i) 


i—k 

oo 

= £ 




i\ 


i=k 

e -P^ B )(p\(B)) k 

k\ 


p\l-p) 


i—k 


so that N*(B) is a Poisson random variable with mean p\(B). 
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Appendix D 

System-status chain transition 
probabilities 


Here we show the calculation of the transition probabilities of the system chain. We use the 
guide lines in [3] to simplify things and use also their notions but under our case which state that 
ri = Si = qi = Pi. Due to this fact there are many more possible transition is the state space and 
therefore the following calculations would be more complicated. In the same manner the calculations 
are based on the change in the number of active users, thus the transitions can be classified into 
four types as follows: 

1. The number of active users transits from 0 to 1: 

Only a blocked user may become active, and only one, say j. All other blocked blocked must 
not exceed the threshold. In addition all idle users which may receive a package must not 
exceed the threshold and therefore, say w, becomes blocked. The latter will repeat it self 
during this calculation. The probability of such a transition is expressed by: 

K—n—w— 1 w n+1 

p(aa =i, ab = —i+w, a/= —w) = n 

k l i p i 

where the product is on the sets of users by their transition and n is the number of blocked 
users after the transition. Note that the user j will be active if his queue is not empty after 
the successful transmission or a package arrived in the beginning of the slot. Also note that 
p and A stand for 1 — p and 1 — A, respectively. 

2. The number of active users remains 1: 

This case divides to two subcases, the same active user j remains active or another blocked 
user s becomes active. Nevertheless w idle users may still become blocked. 

(a) the probability for the first case is: 

K—n—w— 1 w n+1 

P(AA = 0, A B = +w, A I = -w,j -> j) = [] A, JJ A ,p t ]Jp^ [+(1|1) + A,(1- P,(l|l))] 

k i i p o 

(b) the probability for the second case is: 

K—n—w— 1 w n+1 

P(AA = 0, AB = +w,AI = -w,j ->s)= A fc JJ A;p z [P s (l|l) + A s (l - P s (l|l))] 

k i i Ps 

3. The number of active users transits from 1 to 0: 

This case is subdivided to three subcases, 
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(a) The first subcase is that the active user j becomes idle: 

K—n—w— 1 w n+1 

P(AA = -l,AB = +w,AI = -w) = J] 

k i i p i 


(b) The second subcase is that the active user j becomes blocked and blocked user s becomes 
idle: 

K—n—w— 1 w n+1 

P(AA = -l,AB = +l-l+w,AI = +l-w) = JJ XkHXm \{Pi^ [A s (l - P s (l|l))] 

k i i Ps 

(c) The third subcase is that the active user j becomes blocked: 

P{AA = -1, AB = +1 + w, AI = -w ) = 

K—n—w— 1 w n+1 

n k n Xip i n ^ 

k l i 

K—n—w —1 w K—n—w— 1 * n+1 

+ n e +np. 

k l s s i 

K—n—w— 1 w / w \ / n \ 


n +II A M 1 -n^ '-lb 


K—n—w —1 w 


11 A feII A ^ 


K—n—w —1 n w 


n n 




+ n 


Pc 

H 

W W 


1 ~T[ps-PiT[pbJ2jr 

s b a F <1 


The first expression is for the case which non of the users exceeds the threshold. The 
second is for the case which one idle user managed to successfully transmits. The third 
case describes the case which one or more from the blocked and the idle groups tries 
to transmit, therefore what happens with user j is meaningless. The forth expression 
describes the situation which user j collided with one or more users from the blocked 
group, or collision happened between the blocked users and j didn’t exceed the threshold. 
The fifth expression is the same as the fourth concerning the group of idle users which 
receives a package. 


4. The number of active users remains 0 : 
This case is subdivided to three subcases, 


(a) The first subcase is that all users maintain their status without change: 


P{AB = 0, A I = 0) 




Pq 

P q 


K—n n K—n K—n 


n A fc+n+ n Afe e 


\Pq 

\ 


where the first term is the probability that no packet arrives at the idle users while 
no blocked user, or at least two blocked users, transmit, and the second term is the 
probability that no blocked user exceeds the threshold while only one of the idle users 
successfully transmits. 

(b) The second subcase is that one blocked user j becomes idle: 


P(AB 


— 1 + w, AI = +! — «;) 


K—n—w —1 w n+1 

n ^ n Xip i n ^ 

k l i 



Pj( 0|2) 
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(c) The third subcase describes some situation for w idle users becomes blocked: 

P(AB = +w, AI = —w) = 

K—n—w w I" n n 

II 

k i L i q 

K—n—w w I" w w w 

+ n 

fc ! L s b q 

K—n—w w w I" n 

+ n 

fc ; ? ^ L * 

K—n—w K—n—w , w n 

+ n ^ e 

fc s s l i 

The first expression is for the case which non of the u, 1 users exceeds the threshold and no 
blocked user, or at least two blocked users, transmit, the second is for the case which at 
least two users from w exceeds the threshold, the third case describes collision between 
one of the users from w with at least one from the blocked users and the forth expression 
describes the situation which one idle user succussed to transmit while all the other does’t 
exceed the threshold. 

According to these transition probabilities we can calculate the steady state of the chain given the 
auxiliary quantities p,(l|l) and p.j(0|2), the probability of exceedance and the users arrival rates. 
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Appendix E 

Average success probabilities 


Once the steady state of the system chain is known, the average success probabilities can be calcu¬ 
lated. We do so by calculate the values of Pb(j),Pa(*) and Pi(i) in the same manner as calculated 
in j3 while taking in consideration that user must exceed the threshold in order to transmit. 

1. For Blocked user i: 

, . _ , . Pr( user i success and user i is blocked) 

PbW = Pr (user i success user i is blocked) = -— - ; ———-——- 

Pr[ user i is blocked) 

where 

Pr (user i success and user i is blocked) = pi E P(S 1 , S K ) n(A J p j +A i )' 5s J -° 

Si= 2 

Sj—0,1,2 

The exponent 5 equals 1 when it’s condition holds. 

Pr{ user i is blocked) = E P(S 1 ,..,S i ,..,S K ) 

Si—2 

2. For Active user i : 

. . . . Pr( user i success and user i is active) 

Paw = Pr {user i success user i is active) =-- ; - ; —r- 

Pr{ user % is active) 

where 

Pr( user i success and user i is active) = Pi -P(<Si ,-,s K ) l[^jP j + x ^ Si=0 (p j ) 5Si=2 


the difference here is due to the fact that no more than one user may be active. 

Pr (user i is active) = ^ P(Si,Si,S k) 

Si =1 


3. For Idle user i: 

Pj(i ) = Pr (user i success | user i is idle) 


Pr (user i success and user i is idle) 
Pr {user i is idle) 
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where 


Pr( user i success and user i is idle) = \pi ^ P(SSk) ]^[(AjP J -t-Aj)' 5s J -0 (pj)' 5 A _1 (p J -) l5s i- 2 

Si= 0 jjii 

Sj—0,1,2 


and 

Pr (user i is idle) = ^ P(S\,Si,S k) 

Si =o 


Using the above success probability it is not difficult to attain the expression of (6.1.61, (6.1.71,(6.1.8),(6.1.9),(6.1.10) 
from the steady state of the queue-length Markov chain. After knowing the these values the bound¬ 
ary conditions can be calculated: 


m 11) 

Pd 0 I 2) 


: _ tt(1.1) 

Gi(l)- 7t(1,0) 
7T(0,0) 

Gj(l) 
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